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DIAGNOSTIC! METHOD 

Technical FipIH 

The inventors have discovered a human gene linked to susceptibility to inflammatory 
bowel disease (JED) using linkage and association analysis. The present invention therefore 
5 relates to diagnostic techniques for the detection of IBD, and for determining a patient's 
susceptibility to develop IBD by detecting all or part of this gene, its precursors or products 
(e.g. mRNA, cDNA, genomic DNA, or protein). The invention is also directed to methods 
for identifying modulators of IBD, which modulators, such as chemical compounds, antisense 
molecules and antibodies modulate the gene identified. 
10 Background to Invention 

Inflammatory bowel disease (IBD) is characterised by a chronic relapsing intestinal 
inflammation of the gastrointestinal tract It affects -1/1,000 individuals in Western countries 
with the median age of onset in early adulthood. To date, the etiology of this disease is 
unknown. Based on clinical and histopathological features, IBD is categorised into two main 

15 subtypes, Crohn's disease (CD) (On Line Mendelian Inheritance in Man - a database 
produced by Johns Hopkins University available at NCBI, OMIM 266600) and ulcerative 
colitis (UC) (OMIM 191390). Although the cause of IBD is unknown, both familial clustering 
of the disease and increased-concordance in monozygotic twins shows a strong genetic 
susceptibility. Estimates of sibling risk (Xs) show a range of 10-50, suggesting that genetic 

20 factors play a significant role in the predisposition to IBD. In the present context the term IBD 
is intended to include IBD, as weU as Crohn's disease and ulcerative colitis. 

Previous genome wide linkage analyses have identified a number of susceptibility locus 
for IBD, e.g. IBD1 (OMIM 266600) (Hugot et al., Nature. 379:821-823, 1996; Brant et al, 
Gastroenterlogy 115:1056-1061, 1998; Curran et al., Gastroenterology 115:1066-1071, 1998; 

25 and, Hampe et al, Am J. Hum Genet. 64:808-816, 1999a), IBD 2 (OMIM 601458) (Duerr et 
al, Am J. Hum Genet 63:95-100, 1988; and, Parkes etal, Am J. Hum Genet. 67:1605- 
1610, 2000), IBD3 (OMIM 604519) (Hampe etal, Am J. Hum Genet 65:1647-1655, 
1999b), IBD7 (OMIM 605225) (Cho etal, Proc. Nat. Acad. Sci. 95:7502-7507; and, Cho et 
al, Hum Molec. Genet. 9:1425-1432, 2000). 

30 There is therefore a desire to identify genes with a significant association to the 
development of IBD. This may enable the development of novel therapies for IBD by 
screening for^ompounds and other entities, such as antibodies, which modulate the activity of 
the proteins encoded by the associated genes. Knowledge of the sequence of the associated 
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genes may also enable the development of novel antigene methods to modulate the expression 
of the associated gene and may also enable the development of novel gene therapy techniques 
to treat IBD. The discovery of associated genes may also assist in developing novel methods 
for diagnosing IBD via (1) analysis of the pattern of genotypes of associated single nucleotide 
5 polymorphisms (SNPs), (ii) measuring the levels of the transcribed mRNA present in affected 
tissue or (iii) measuring the levels of the protein in affected tissue. It is possible that the 
diagnosis of IBD, or the prediction of predisposition to IBD, by these methods may be 
achieved in patients who do not yet display the classical symptoms of the disease. Such 
determination of susceptibility to IBD or the early detection of disease development may lead 

10 to earlier clinical intervention than is currently posstblB^i^ayieadno^ore-effeetive 

treatment of the disease. 

The present invention is based on our discovery of an association with IBD for a single 
gene termed dlg5 located on chromosome 10q22.3. 

As used herein, the gene is referred to as the dlg5 gene. Specifically, the cDNA 
15 sequence is shown in SEQ ID No: 1. Encoded protein is shown in SEQ ED No: 2 and is 

referred to as DLG5. A C-terminally truncated cDNA sequence is shown in SEQ ID NO: 189 
and its encoded protein is shown in SEQ ID NO: 190. 

DLG5 belongs to the so-called MAGUK family of proteins (Membrane Associated 
Guanylate Kinases, reviewed in Dimitratos et al BioEssays 21:912-921, 1999). Proteins of 
20 this family contain several distinct protein motifs including a guanylate kinase domain, one or 
several PDZ domains (Postsynaptic density 95, Discs large, Zona occludens-1 domain) and a 
SH3 domain (src homology domain 3). PDZ domains and SH3 domains have been shown to 
mediate protein-protein interactions. In several cases where PDZ domain interactions have 
been characterised they have been shown to interact with short C-terminal sequences of 
25 membrane proteins (Kreienkamp, Curr. Opin. Pharm. 2:581-586, 2002). SH3 domains have 
been found to interact with proline rich surface regions of target proteins. Since the guanylate 
kinase domain for some MAGUK proteins has been shown to mediate protein-protein 
interactions while it lacks kinase activity, it is generally believed that the main function also 
- . for this domain is tpjnediate protein-protein interactions. Therefore, MAGUK proteins are 
30 considered as scaffold proteins, orchestrating signalling molecules to specific membrane 

locations. Besides establishment of cell polarity of epithelia, MAGUK proteins have also been 
implicated in establishment of postsynaptic compartments in neurons (Kreienkamp, Curr. 
Opin. Phaim 2:581-586, 2002). R>r example, an interaction between a PDZ domain of the 
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MAGUK protein hDLGl^PSD-95 and the intraceUular tail of the NMDA receptor has been 
identified. Since it has been identified that the C-terrnini of some 50 intracellular and 
membrane proteins have high affinity for the PDZ domains of hDLG-l/PSD-95, it has been 
hypothesised that such clustering of scaffold proteins to multiple membrane receptors is 
5 responsible for localisation of neuroreceptors at postsynaptic sites (Kreienkamp, Curr. Opin 
Pharm, 2:581-586, 2002). 

Until recently, little has been known about the function of DLG5. Partial EST sequences 
for dlg5 were identified from a database search by its similarity to other MAGUK proteins 
(Nakamura et al, FBBS 433:63-67, 1998). The authors identified a partial cDNA sequence 
10 referred to as P-dlg and showed by irnmunostaining that the protein was expressed in 
epithelial gland cells of the prostate. It was also shown by a two-hybrid screen that the 
DLG5/P-dlg specifically interacted withp55, another MAGUK protein. Northern blot 
analyses showed variable expression in multiple tissues (Nakamura et al, FEBS 433:63-67, 
1998 and Shah et al, BMC Genomics 3:6 2002). Shah et al. also showed that the human 
15 gene consisting of 32 exons, encoded a full length DLG5 protein of 1809 amino acids. The 
DLG5 protein contains 4 PDZ domains followed by an SH3 domain and a C terminal 
guanylate kinase domain. 

It was recently shown that DLG5 could be identified in a two-hybrid screen using 
vinexin as bait (Wakabayashi et al, JBC, 25th March 20032003). Furthermore, the authors 
20 showed that vinexin, DLG5 and P-catenin could form a ternary complex, providing a direct 
link to the adhesion junction complex in epithelial cells. 

In vertebrate gut epithelial cells three types of cell junctions are formed (reviewed in 
Tsukita et al,. Nature Rev. Moi Cell. Biol. 2:285-293, 2001). Tight junctions are located 
towards the apical border of the basolateral side and are considered to function both as a 
25 barrier for the extracellular environment as well as a fence for membrane diffusion 

Adherence junctions are formed immediately basolateral of tight junctions and their role is 
less clear than for tight junctions. They are considered to be important for the mechanical 
strength of cell contacts, but it is also clear that their regulation has to be precisely 
coordinated with tight junctions, for example when immune cells passes through the epithelial 
30 barrier. The mutual dependence between tight junctions and adherence junctions are 
underscored by the findings that while formation of tight junctions does not occur until 
adherence junctions are intact, adherence junctions can not form when formation of tight 
junctions are inhibited by overexpression of a dominant negative mutant of the tight junction 
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MAGUK protein ZO-3 <Wittchen et al, J. CeU. BioL 151:825-836, 2000; and refs therein). 
Finally, multiple desmo somes are located along the basolateral sides and are mainly 
considered to contribute to the mechanical strength of cell contacts. 

Many proteins have been shown to be localised to cellular junctions. Membrane 
5 proteins, such as for example occludin and claudins, are found at tight junctions while 
members of the cadherin family mediates cell-cell contacts at adherence junctions. A large 
number of proteins connect to the cytoplasmatic side of these membrane proteins, linking the 
complexes both to the actin cytoskeleton and to intracellular signaling. At adherence 
junctions, the cytoplasmatic part of cadherin binds p-catenin, thus providing a link between 
10 DLG5 and adherence junctions. 

The background data l above strongly supports a functional role for DLG5 in gut 
epithelial cell function and integrity. The inventors propose that protein(s) encoded by the 
dlg5 gene, which has only now been identified as being genetically linked to susceptibility to 
IBD, are directly or indirectly involved in the pathogenesis of gut inflammation. 
15 The inventors have- identified 20 unique nucleotide variations within the dlg5 gene, four 

of these result in codon changes, a further two are deletions. 
Summary of the invention 

The inventors have identified a gene located on chromosome 10q22.3, termed dlg5, 
which demonstrates genetic association linkage to susceptibility to IBD. The gene, mRNA 
20 (or cDNA prepared therefrom) and protein sequences corresponding to such transcript are 
therefore diagnostic or prognostic markers of IBD, and can be used to design specific probes, 
or to generate antibodies, capable of detecting the presence of nucleotide sequence 
polymorphims or mutations of the gene or ihRNA, or of measuring the levels of the mRNA or 
encoded protein present in a test sample, such as a body fluid or cell sample. In addition the 
25 gene and protein encoded thereby is a potential target for therapeutic intervention in IBD 
disease, for instance in the development of antisense nucleic acid targeted to the mRNA, or 
transgenic therapies; or more widely in the identification or development of chemical or 
hormonal therapeutic agents. The person skilled in the art is also capable of devising 
screening "assays'to identify*compounds*(chemical or biological) that modulate (activate.on. . 
30 inhibit) the identified gene or encoded protein, which compounds may prove useful as 
therapeutic agents in treating or preventing IBD. 
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Detailed description nf the mvf^Hrm 



According to a first aspect of the invention there is provided a method for identifying a 
compound capable of modulating the action of the DLG5 protein which method comprises 
subjecting one or more test compounds to a screen comprising a polypeptide containing the 
5 ammo acid sequence shown in SEQ ID NO: 2, or a homologue thereof or a fragment of either. 

The term "fragment" as used herein refers to a subsequence of the full length sequence 
that comprises at least 25, preferably at least 50, more preferably at least 100 consecutive 
amino acids of the sequence depicted in SEQ ID NO: 2, preferably the fragment is a 
polypeptide that is the DLG5 protein with either or both C-terminal and N-terminal 
0 truncations, such as the polypeptide depicted in SEQ ID NO: 190. 

It is understood mat the polypeptide for use in the invention may be both a fragment and 
a homologue of the DLG5 protein. 

In a preferred embodiment, the screening methods of the invention are carried out using 
a polypeptide comprising an amino acid sequence as depicted in SEQ ID NO: 2, or a sequence 
5 possessing, in increasing order of preference, at least 80%, 85%, 90%, 95%, 97%, 98% and 
99% amino acid sequence identity thereto. Such variants are herein referred to as 
"homologues". 

The sequence identitybetween two sequences can be determined by pair-wise computer 
alignment analysis, using programs such as, BestFit, Gap or FrameAlign. The preferred 
0 ahgnment tool is BestFit. In practise, when searching for similar/identical sequences to the 
query search, from within a sequence database, it is generally necessary to perform an initial 
identification of similar sequences using suitable software such as Blast, Blast2, NCBI Blast2 
WashU Blast2, FastA, Fasta3 and PILEUP, and a scoring matrix such as Blosum 62. Such 
software packages endeavour to closely approximate the "gold-standard" ahgnment algorithm 
of Smith-Waterman. Thus, the selected software/search engine programme for use in 
assessing identity/similarity, i.e how two primary polypeptide sequences line up is Smith- 
Waterman. Identity refers to direct matches, similarity allows for conservative substitutions. 

Allelic variants or versions of the DLG5 protein may exist within the human population 
particularly between distinct ethnic groups. A further aspect of the invention involves the 
selection and use of the appropriate version of the DLG5 protein to be included in screens so 
as to discover compounds capable of altering the activity of said DLG5 version,* vivo The 
inventors have identified four codon changing nucleotide polymorphisms within one or other 
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exons of dlg5 gene, each of these, alone or in combination, would provide numerous allelic 
variant protein versions of DLG5 for use in any aspect of the present invention. 

Investigators may wish to screen their compounds against the most prevalent version of 
the DLG5 protein and also against the less frequent versions of the DLG5 protein in order to 
5 detect any differential pharmacological activity between the various versions of the target. A 
further aspect of the invention is the screening of various ethnic based populations to measure 
the allele frequencies of the nucleotide polymorphisms in the dlg5 gene within said 
populations. This information may be of value in estimating the efficacy of new compounds 
capable of altering the activi ty of DLG5 within these populations and in particular in 

10 estimating the proportion of the population which may not respond to the therapy. " 

Polymorphism refers to the occurrence of two or more genetically determined 
' alternative alleles or sequences within a population. A polymorphic marker is the site at 
which divergence occurs. Preferably markers have at least two alleles, each occurring at 
frequency of greater than 1%, and more preferably at least 10%, 15%, 20%, 30% or more of a 
15 selected population. 

Single nucleotide polymorphisms (SNP) are generally, as the name implies, single 
nucleotide or point variations that exist in the nucleic acid sequence of some members of a 
species. Such polymorphism variation within the species, is generally regarded to be the 
result of spontaneous mutation throughout evolution. The mutated and normal sequences co- 
20 exist within the species' population sometimes in a stable or quasi-stable equilibrium. At 
other times the mutation may confer some advantage to the species and with time may be 
incorporated into the genomes of all or a majority of members of the species. 

Some SNPs occur in the protein coding sequences, in which case, one of the 
polymorphic protein forms may possess a different amino acid which may give rise to the 
25 expression of a variant protein and, potentially, a genetic disease. These changes in function 
may be mediated by several mechanisms including, but not limited to, alterations in protein 
folding, alterations in ligand and substrate binding affinity and alterations in membrane 
binding affinity and may lead to gain of activity or loss of activity for the protein in vivo. 
Such-alterations in the activity of the. protein in vivomay.be ofjclinical significance in the 
30 development of IBD. Alteration to the amino acid sequence of the protein may also affect the 
efficacy of drug therapy for IBD by altering the specificity between protein and compounds 
selected by screening to modulate its activity. Thus compounds selected by screening may 
have different efficacies in modulating the activity of protein in different individuals 

BEST AVAILABLE COPY 




101073-2 

• 

according to the versions of the gene that they carry. In particular an individual who is 
homozygous for a less common variant of the gene may not respond well to a therapy 
developed by screening compounds against the dominant variant. 

The screening methods according to the invention may be carried out using conventional 

5 procedures, for example by bringing the test compound or compounds to be screened and an 
appropriate substrate into contact with the polypeptide, or a cell capable of producing it, or a 
cell membrane preparation thereof, and determining affinity for the polypeptide in accordance 
with standard techniques. 

Any compound identified in this way may prove useful in the treatment of IBD in 

10 humans and/or other animals. The invention thus extends to a compound selected through its 
ability to regulate the activity of the DLG5 protein in vivo as primarily determined in a 
screening assay utilising the polypeptide containing an amino acid sequence shown in SEQ 
ID NO: 2, or a homologue or fragment thereof, or a gene coding therefore (such as that 
disclosed in SEQ ID NO: 1) for use in the treatment of a disease in which the over- or under- 

15 activity or unregulated activity of the protein is implicated. 

According to a further aspect of the invention there is provided a screening assay or 
method for identifying potential anti-IBD therapeutic compounds comprising contacting an 
assay system capable of detecting the effect of a test compound against expression level of 
DLG5, with a test compound and assessing the change in expression level of DLG5. 

20 Compounds that modulate the expression of DNA or RNA of DLG5 polypeptides may 

be detected by a variety of assay systems. A suitable assay system may be a simple "yes/no" 
assay to determine whether there is a change in expression of a reporter gene, such as beta- 
galactofcidase, luciferase, green fluorescent protein or others known to the person skilled in the 
art (reviewed by Naylor, Biochem. Pharmacol. 58:749-57, 1999). The assay system may be 

25 made quantitative by comparing the expression or function of a test san^le with the levels of 
expression or function in a standard sample. Systems in which transcription factors are used 
to stimulate a positive output, such as transcription of a reporter gene, are generally referred to 
as "one-hybrid systems" (Wang, M.M. and Reed, R.R. (1993) Nature 364:121-126). Using a 
transcription factor to stimulate a negative output (growth inhibition) may thus be referred to 

30 as a Reverse one-hybrid system" (Vidal et al, 1996, supra). Therefore, in an embodiment of 
the present invention, a reporter gene is placed under the control of the dlg5 promoter. A 
suitable dlg5 promoter sequence is disclosed in SEQ ID NO: 5. 
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In a further aspect of the invention we provide a cell or cell line comprising a reporter 
gene under the control of the dlg5 promoter. 

According to another aspect of the present invention there is provided a method of 
screening for a compound potentially useful for treatment of IBD, which comprises assaying 
5 the compound for its ability to modulate the activity or amount of DLG5. Preferably the 
assay is selected from 

(i) measurement of DLG5 activity using a cell line which expresses the DLG5 polypeptide or 
using purified DLG5 polypeptide; and 

(ii) measurement of dl g5 transcription or translation in a cell line expressing the DLG5 

10 polypeptide. ~~~~™ 

The "DLG5 polypeptide" refers to the DLG5 protein, a homologue thereof, or a 
fragment of either. 

Thus, in a further aspect of the invention, cell cultures expressing the DLG5 polypeptide 
~ can be ^ a screen for therapeutic agents. Effects of test compounds may be assayed by 
U5 changes in mRNA or protein of DLG5. As described below, cells (i.e. mammalian, bacterial, 
yeast etc.) can be engineered to express the DLG5 polypeptide. 

Thus, according to a further aspect of the invention there is provided a method of testing 
potential therapeutic agents for the ability to suppress IBD phenotype comprising contacting a 
test compound with a cell engineered to express the DLG5 polypeptide; and detennining 
20 whether said test compound modulates expression of the DLG5 polypeptide. 

We also provide a method for identifying inhibitors of transcription of dlg5, which 
method comprises contacting a potential therapeutic agent with a cell or cell line as described 
above and determining inhibition of dlg5 transcription by the potential therapeutic agent by 
reference to a lack of or reduction in expression of the reporter gene. 
25 Any convenient test compound or library of test compounds may be used in conjunction 
with the test assay. Particular test compounds include low molecular weight chemical 
compounds (preferably with a molecular weight less than 1500 daltons) suitable as 
pharmaceutical or veterinary agents for human or animal use, or compounds for non- 
administered use such-as cleaning/sterilising agents or for agricultural use. Test compounds- - 
30 may also be biological in nature, such as antibodies. 

According to a further aspect of the invention there is provided a compound identified 
by a screening method as defined herein. 
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According to another aspect of the present invention there is provided use of a 
compound able to modulate the activity or amount of DLG5 in the preparation of a 
medicament for the treatment of IBD. Modulation of the amount of DLG5 by a compound 
may be brought about for example through altered gene expression level or message stability. 
5 Modulation of the activity of DLG5 by a compound may also be brought about for example 
through compound binding to the DLG5 protein. In one embodiment, modulation of DLG5 
comprises use of a compound able to reduce the activity or amount of DLG5. In another 
embodiment, modulation of DLG5 comprises use of a compound able to increase the activity 
or amount of DLG5. 

10 It will be appreciated that the term 'for the treatment of IBD', and variations thereon, 

includes therapeutic and prophylactic (preventative) treatment. 

According to another aspect of the present invention there is provided a method of 
preparing a pharmaceutical composition which comprises: 

i) identifying a compound as useful for treatment of IBD according to a screening method as 
15 described herein; and 

ii) mixing the compound or a pharmaceutically acceptable salt thereof with a 
pharmaceutically acceptable excipient or diluent. 

According to a further aspect of the invention there is provided a method of treatment of 
a patient suffering from IBD comprising administration to said patient of an effective amount 

20 of a compound identified according to a screening method of the invention or a composition 
prepared by the method described herein. 

The compositions of the invention may be in a form suitable for oral use (for example as 
tablets, lozenges, hard or soft capsules, aqueous or oily suspensions, emulsions, dispersible 
powders or granules, syrups or elixirs), for topical use (for example as creams, ointments, 

25 gels, or aqueous or oily solutions or suspensions), for administration by inhalation (for 

example as a finely divided powder or a liquid aerosol), for administration by insufflation (for 
example as a finely divided powder) or for parenteral administration (for example as a sterile 
aqueous or oily solution for intravenous, subcutaneous, intramuscular or intramuscular dosing 
or as a suppository for rectal dosing). 

30 The compositions of the invention may be obtained by conventional procedures using 

conventional pharmaceutical excipients, well known in the art. Thus, compositions intended 
for oral use may contain, for example, one or more colouring, sweetening, flavouring and/or 
preservative agents. 
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Suitable pharmaceutical^ acceptable excipients for a tablet formulation include, for 
example, inert diluents such as lactose, sodium carbonate, calcium phosphate or calcium 
carbonate, granulating and disintegrating agents such as com starch or algenic acid; binding 
agents such as starch; lubricating agents such as magnesium stearate, stearic acid or talc; 
5 preservative agents such as ethyl or propyl p-hydroxybenzoate, and anti-oxidants, such as 
ascorbic acid. Tablet formulations may be uncoated or coated either to modify their 
disintegration and the subsequent absorption of the active ingredient within the 
gastrointestinal track, or to improve their stability and/or appearance, in either case, using 
conventional coating agents and procedures well known in the art. 



10 Compositions for oral use may be in the form of hard gelatin capsules in which the 

active ingredient is mixediwith an inert solid diluent, for example, calcium carbonate, calcium 
phosphate or kaolin, or raoft gelatin capsules in which the active ingredient is mixed with 
water or an oil such as peanut oil, liquid paraffin, or olive oil. 

Aqueous suspensions generally contain the active ingredient in finely powdered form 

15 together with one or morenmspending agents, such as sodium carboxymethylcellulose, 

methylcellulose, hydroxypropylmethylcellulose, sodium alginate, polyvinylpyrrolidone, gum 
tragacanth and gum acacia; dispersing or wetting agents such as lecithin or condensation 
products of an alkyleneuxide with fatty acids (for example polyoxethylene stearate), or 
condensation products "of ethylene oxide with long chain aliphatic alcohols, for example 

20 heptadecaethyleneoxycetanol, or condensation products of ethylene oxide with partial esters 
derived from fatty acids und a hexitol such as polyoxyethylene sorbitol monooleate, or 
condensation products orethylene oxide with long chain aliphatic alcohols, for example 
heptadecaethyleneoxycetanol, or condensation products of ethylene oxide with partial esters 
derived from fatty acids and a hexitol such as polyoxyethylene sorbitol monooleate, or 

25 condensation products of ethylene oxide with partial esters derived from fatty acids and 

hexitol anhydrides, for example polyethylene sorbitan monooleate. The aqueous suspensions 
may also contain one or more preservatives (such as ethyl or propyl p-hydroxybenzoate, anti- 
oxidants (such as ascorbic acid), colouring agents, flavouring agents, and/or sweetening 
agents (such-as sucrose, saccharine or aspartame): 

30 Oily suspensions may be formulated by suspending the active ingredient in a vegetable 
oil (such as arachis oil, olive oil, sesame oil or coconut oil) or in a mineral oil (such as liquid 
paraffin). The oily suspensions may also contain a thickening agent such as beeswax, hard 
paraffin or cetyl alcohoL Sweetening agents such as those set out above, and flavouring 
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agents may be added to provide a palatable oral preparation. These compositions may be 
preserved by the addition of an anti-oxidant such as ascorbic acid. 

Dispersible powders and granules suitable for preparation of an aqueous suspension by 
the addition of water generally contain the active ingredient together with a dispersing or 
5 wetting agent, suspending agent and one or more preservatives. Suitable dispersing or 
wetting agents and suspending agents are exemplified by those already mentioned above. 
Additional excipients such as sweetening, flavouring and colouring agents, may also be 
present. 

The pharmaceutical compositions of the invention may also be in the form of 
10 oil-in- water emulsions. The oily phase may be a vegetable oil, such as olive oil or arachis oil, 
or a mineral oil, such as for-example liquid paraffin or a mixture of any of these. Suitable 
emulsifying agents may bejrfor example, natnraUy-occrurring gums such as gum acacia or gum 
tragacanth, naturally-occurring phosphatides such as soya bean, lecithin, an esters or partial 
esters derived from fatty, acids and hexitol anhydrides (for example sorbitan monooleate) and 
15 condensation products of the said partial esters with ethylene oxide such as polyoxyethylene 
sorbitan monooleate. sThe emulsions may also contain sweetening, flavouring and 
preservative agents. 

Syrups and elixirs may be formulated with sweetening agents such as glycerol, 
propylene glycol, sorbitol, -aspartame or sucrose, and may also contain a demulcent, 
20 preservative, flavouring and/or colouring agent. 

The pharmaceutical compositions may also be in the form of a sterile injectable aqueous 
or oily suspension, which may be formulated according to known procedures using one or 
more of the appropriate dispersing or wetting agents and suspending agents, which have been 
mentioned above. A sterile injectable preparation may also be a sterile injectable solution or 
25 suspension in a non-toxic parenterally- acceptable diluent or solvent, for example a solution in 
1,3-butanedioL 

Suppository formulations may be prepared by mixing the active ingredient with a 
suitable non-irritating excipient, which is solid at ordinary temperatures but liquid at the rectal 
temperature and will therefore melt in the rectum to release the drug. Suitable excipients 
30 include, for example, cocoa butter and polyethylene glycols. 

Topical formulations, such as creams, ointments, gels and aqueous or oily solutions or 
suspensions, may generally be obtained by formulating an active ingredient with a 
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conventional, topically acceptable, vehicle or diluent using conventional procedure well 
known in the art. 

Compositions for administration by insufflation may be in the form of a finely divided 
powder containing particles of average diameter of, for example, 30u. or much less, the 
5 powder itself comprising either active ingredient alone or diluted with one or more 
physiologically acceptable carriers such as lactose. The powder for insufflation is then 
conveniently retained in a capsule containing, for example, 1 to 50mg of active ingredient for 
use with a turbo-inhaler device, such as is used for insufflation of the known agent sodium 
cromoglycate. 

10 Compositions for administration by inhalation may be in the form of a conventional 

pressurised aerosol arranged to dispense the active ingredient either as an aerosol containing 
finely divided solid or liquid droplets. Conventional aerosol propeUants such as volatile 
fluorinated hydrocarbons or hydrocarbons may be used and the aerosol device is conveniently 
arranged to dispense a metered quantity of active ingredient. 

15 For further information on Formulation the reader is referred to Chapter 25.2 in Volume 

5 of Comprehensive Medicinal Chemistry (Corwin Hansch; Chairman of Editorial BIBDrd), 
Pergamon Press 1990. 

The amount of active ingredient that is combined with one or more excipients to 
produce a single dosage form will necessarily vary depending upon the host treated and the 

20 particular route of administration For example, a formulation intended for oral 

administration to humans will generally contain, for example, from 0.5 mg to 2 g of active 
agent compounded with an appropriate and convenient amount of excipients which may vary 
from about 5 to about 98 percent by weight of the total composition Dosage unit forms will 
generally contain about 1 mg to about 500 mg of an active ingredient. For further information 

25 on Routes of Administration and Dosage Regimes the reader is referred to Chapter 25.3 in 
Volume 5 of Comprehensive Medicinal Chemistry (Corwin Hansch; Chairman of Editorial 
BIBDrd), Pergamon Press 1990. 

The size of the dose for therapeutic or prophylactic purposes of a compound will 
— - naturally vary according to the nature-and-severity of the conditions, the age and sex of the 

30 animal or patient and the route of administration, according to well known principles of 
medicine. 

In using a compound for therapeutic or prophylactic purposes it will generally be 
administered so that a daily dose in the range, for example, 0.5 mg to 75 mg per kg body 
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weight is received, given if required in divided doses. In general lower doses will be 

administered when a parenteral route is employed Thus, for example, for intravenous 

administration, a dose in the range, for example, 0.5 mg to 30 mg per kg body weight will 

generally be used. Similarly, for administration by inhalation, a dose in the range, for 

5 example, 0.5 mg to 25 mg per kg body weight will be used. Oral administration is however 
preferred. 

Having identified that the dlg5 gene is implicated in IBD, this presents many molecular 
diagnostic opportunities. It is known to persons skilled in the art that clinically significant 
information may be obtained by the measurement of the levels of nucleic acids, proteins or 
10 other analytes that occur within biological samples. When nucleic acids, proteins or other 
analytes occur in polymorphic form then there may aiso be diagnostic utility in by identifying 
which of the various versions of said polymorphic nucleic acids, proteins or other analytes 
occur within a sample. 

An investigator may wish to measure the levels of DLG5 protein or to measure the 
15 levels of dlg5 mRNA transcript present in a sample. An investigator may also wish to 
perform nucleic acid sequence analyses to detect variant nucleotides present within the 

- sample, these analyses may be performed on either the DNA or RNA fraction of the sample 

- and are well known to the person skilled in the art. An investigator may also wish to perform 
protein sequence analysis either directly, by degradation based techniques which are well 

20 known in the art, or indirectly by molecular recognition techniques including immunoassay, 
or by techniques based on detecting changes in the physical characteristics of the protein such 
as functional or substrate specificity assays or iso-electric focusing. 

According to a further aspect of the invention there is provided a method for diagnosing 
or prognosing or monitoring IBD, comprising testing a biological sample for aberrant levels 

25 of DLG5. 

The term "aberrant levels" refers to levels that are outside the normal range. The normal 
range can be determined by testing many normal tissues or may be determined from a side by 
side comparison of the test sample with the normal or control sample. For the purposes of 
this application, aberrant expression refers to a 1.5 -fold difference or more in level of nucleic 
30 acid in a disease sample compared to control normal. Nucleic acid as used herein refers to 
both RNA and DNA 

The test biological sample is conveniently a sample of sinovial fluid, blood, buccal 
scrape, urine or other body fluid or tissue obtained from an individual. 
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The invention lies in the identification of the gene identified herein being linked to IBD 
disease prevalence. Accordingly, in part, the invention is directed to any diagnostic method 
capable of assessing the differential expression level, relative to expression in control tissues, 
of the dlg5 gene identified herein, either alone or as a panel In particular, such methods 
5 include assessment of inRNA transcript levels and/or protein levels. The presence of aberrant 
expression levels of the gene indicating the presence of IBD or an increased likelihood to 
develop the disorder. 

As noted above, in one embodiment the diagnostic/detection methods of the invention 
are employed to detect the presence of one or more SNPs or small insertions, deletions or 

10 duplications of DLG5 oollgS. Suitable SNPs and deletions of DLG5 or dlg5 includeTBose 

identified in Table 3. 

Knowledge of polymorphisms can be of assistance in identifying patients susceptible to 
particular diseases and thoseanost suited to therapy with particular pharmaceutical agents (the 
latter is often termed pharmacogenetics") . Pharmacogenetics can also be used in 

15 pharmaceutical research to assist the drug selection process. Polymorphisms are used in 
mapping the human genome and to elucidate the genetic component of diseases. The reader 
is directed to the following references for background details on pharmacogenetics and other 
uses of polymorphism detection: Linder et al (1997), Clinical Chemistry, 43:254; Marshall 
(1997), Nature Biotechnology. 15:1249; International Patent Application WO 97/40462, 

20 Spectra Biomedical; and-Schafer et al y (1998), Nature Biotechnology. 16:33. 

A haplotype is a set of alleles found at linked polymorphic sites (such as within a gene) 
on a single (paternal or maternal) chromosome. If recombination within the gene is random, 
there may be as many as 2 n haplotypes, where 2 is the number of alleles at each SNP and n is 
the number of SNPs. One approach to identifying mutations or polymorphisms which are 

25 correlated with clinical response is to carry out an association study using all the haplotypes 
that can be identified in the population of interest. The frequency of each haplotype is limited 
by the frequency of its rarest allele, so that nucleotide sequence polymorphisms with low 
frequency alleles are particularly useful as markers of low frequency haplotypes. As 
particular mutations or -polymorphisms.associated with certain clinical features, such as 

30 adverse or abnormal events, are likely to be of low frequency within the population, low 
frequency nucleotide sequence variations may be particularly useful in identifying these 
mutations (for examples see: Linkage disequilibrium at the cystathionine beta synthase (CBS) 
locus and the association between genetic variation at the CBS locus and plasma levels of 
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homocysteine (De Stefano et al, Ann Hum Genet (1998) 62:481-90; and, Keightley et al M 
Blood (1999) 93:4277-83). 

Clinical trials have shown that patient response to treatment with pharmaceuticals is 
often heterogeneous. Thus there is a need for improved approaches to pharmaceutical agent 
5 design and therapy. 

Point mutations in polypeptides will be referred to as follows: natural amino acid (using 
1 or 3 letter nomenclature), position, new amino acid. For (a hypothetical) example "D25K" 
or "Asp25Lys" means that at position 25 an aspartic acid (D) has been changed to lysine (K). 
The presence of a particular nucleic acid base at a polymorphism position will be 
10 represented by the base following the polymorphism position. For (a hypothetical) example, 
the presence of adenine at position 300 will be represented as: 300A. 

We provide examples of nucleotide polymorphisms, including those that affect the 
amino acid sequence of the DLG5 protein. Such amino acid changing polymorphisms are 
indicated in Table 3 as '^on-synonymous". 
15 Nucleotide polymorphisms (mutations) in the promoter and UTR regions may also 

affect the transcription and expression of the dlg5 gene leading to either increased or 
decreased levels of expression or to unregulated activity of the DLG5 protein in vivo. Such 
alterations in the level of expression of the DLG5 protein in vivo may result in a gain or loss 
of function, which is of clinical significance. Recently, it has been reported that even 
20 polymorphisms that do not result in an amino acid change can cause different structural folds 
of mRNA with potentially different biological functions (Shen et al, (1999) Proc Natl Acad 
SciUSA 96:7871-7876). 

In one embodiment of the invention the screening methods described herein utilise a 
DLG5 protein variant which is transcribed from a nucleic acid sequence based on that shown 
25 inSEQIDNO:lor6. 

Nucleotide polymorphisms within dlg5 or DLG5 may also be used as diagnostic 
markers of predisposition to disease. Genotyping nucleotide sequence variants in populations 
suffering from IBD and in control populations not suffering from IBD but matched for factors 
including, but not limited to, racial ancestry, country of origin, sex, age and body mass index 
30 may allow investigators to identify increased risk factors associated with the development of 
IBD disease according to the inheritance of certain SNP genotypes or haplotypes which are 
more prevalent in populations with IBD compared to their incidence in the corresponding 
control populations. This may enable screening for individuals at increased risk of 
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developing IBD by measuring the genotypes and haplotypes of these nucleotide sequence 
polymorphisms within non-symptomatic individuals. We have discovered novel sequence 
polymorphisms in the dlg5 gene which may be useful for the diagnosis of IBD. Public domain 
nucleotide sequence variations, which may also be useful for the diagnosis of IBD, or for 
5 research into IBD, are also identified herein Table 3 lists the sequence polymorphisms in 
dlg5. Those database-derived polymorphisms present in the public domain are annotated as 
either rs- or tsc-. The other SNPs, annotated DLGe, are believed to be identified herein, for 
the first time. Tsc- stands for the SNP consortium. 

A nucleotide sequence vari ation or polvmorhisms could be a single nucleotide 



10 polymorbism, a deletion of one or several nucleotides, a duplication of one or several 

nucleotides or an insertion-of one or several nucleotides in the nucleotide sequence of the 

gene or in sequences modulating the expression of the dlg5 gene. 

According to one-aspect of the present invention there is provided a method for the 

diagnosis of a single nucleotide polymorphism associated with IBD, which method comprises 
15 deteimining from human. nucleic acid, the identity of the nucleotide at position 16 according 

to one or more of SEQTDNOs: 7, 9, 1 1, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 

41, 43, 45, 47, 49, 51,53r55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 

91 and 93; and determining the status of the human by reference to polymorphism(s) detected. 

With respect to the mutation disclosed in SEQ ID NO: 32 and 33, the nucleotide at position 
20 16 will either be C, or in the allele with the 7-base deletion, a G. 

The term human includes both a human having or suspected of having inflammatory 

bowel disease and an asymptomatic human who may be tested for predisposition or 

susceptibility to IBD. At each position the human may be homozygous for an allele or the 

human may be a heterozygote. 
25 In one embodiment of the invention preferably the method for diagnosis described 

herein is one in which the single nucleotide polymorphism located at position 16 (according 

to SEQ ID NO: 7) is the presence of G and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 

herein is cmein which the single nucleotide polymorphism located at position 16(according 
30 to SEQ ID NO: 9) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 

herein is one in which the single nucleotide polymorphism located at position 16 (according 

to SEQ ID NO: 1 1) is the presence of C and/or T. 
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In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 13) is the presence of G and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 
5 herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 15) is the presence of G and/or C. 

Li one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 17) is the presence of C and/or T. 
10 In one embodimentiof the invention preferably the method for diagnosis described 

herein is one in which thegsingle nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 19) is thespresence of A and/or G. 

In one embodimentfof the invention preferably the method for diagnosis described 
herein is one in which the^ingle nucleotide polymorphism located at position 16 (according 
15 to SEQ ID NO: 2 1) is the presence of C and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the- single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 23) is the presence of C and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 
20 herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 25) ife the presence of G and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 27) is the presence of C and/or T. 
25 In one embodiment of the invention preferably the method for diagnosis described 

herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 29) is the presence of C and/or G. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
30 to SEQ ID NO: 31) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 35) is the presence of G and/or A. 
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In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 37) is the presence of G and/or C. 

In one embodiment of the invention preferably the method for diagnosis described 
5 herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 39) is the presence of G and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 41) is the presence of C and/or T. 
10 In one embodiment of the invention preferably the method for diagnosis described 

herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 43) is the presence of G and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
15 to SEQ ID NO: 45) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 47) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
20 herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 49) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 51) is the presence of C and/or T. 
25 In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 53) is the presence of G and/or C. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single-nucleotide polymorphism located- at position 1 6 (according 
30 to SEQ ID NO: 55) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 57) is the presence of G and/or C. 



BEST AVAILABLE COPY 



101073-2 

-19- 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 59) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
5 herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 61) is the presence of C and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 63) is the presence of T and/or A. 
10 In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 65) is the presence of C and/or G. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
15 to SEQ ID NO: 67) is the presence of G and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 69) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
20 herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 71) is the presence of A and/or G (as a result of a single base deletion). 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 73) is the presence of C and/or T. 
25 In one embodiment of the invention preferably the method for diagnosis described 

herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ JD NO: 75) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
30 to SEQ ID NO: 77) is the presence of C and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 79) is the presence of C and/or T. 
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In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 81) is the presence of C and/or T. 

In one embodiment of the invention preferably the method for diagnosis described 
5 herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 83) is the presence of G and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 85) is the presence of C and/or T. 
10 In one embodiment of the invention preferably the method for diagnosis described 

herein is one in which thesingle nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 87) is thejpcesence of A and/or G. 

In one embodimentfbf the invention preferably the method for diagnosis described 
herein is one in which theaingle nucleotide polymorphism located at position 16 (according 
15 to SEQ ID NO: 89) is .thezpresence of G and/or A. 

In one embodimentrof the invention preferably the method for diagnosis described 
herein is one in which the single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 9 1) is the-presence of C and/or G. 

In one embodiments the invention preferably the method for diagnosis described 
20 herein is one in which. the~single nucleotide polymorphism located at position 16 (according 
to SEQ ID NO: 93) is the presence of G and/or A. 

In one embodiment of the invention preferably the method for diagnosis described 
herein is one in which there is the presence or absence of a 7-base deletion located at position 
16 (according to SEQ ID NO: 33). 
25 In another aspect of the invention there is provided a method for the diagnosis of B3D or 

determining susceptibility to develop IBD, which method comprises: 

(i) obtaining a protein or nucleic acid containing sample from an individual; 

(ii) detecting the presence or absence of a variant DLG5 on the basis of the presence of a 
polymorphic amino acid-within the DLG5 protein, or-a polymorphic base within the dlg5 

30 gene sequence; and, 

(iii) determining the status of the human by reference to the presence or absence of a 
polymorphism in DLG5. 




iifcST AVAILABLE COPY 



101073-2 



-21- 

In one embodiment the polymorphic amino acid is located at position 140, 231, 624, 
1067, 1089 or 1481 according to SEQ ED NO: 2. 

In a particular embodiment the polymorphism is selected from the group consisting of: 
Glnl40Arg, Ser321Gly, Glu624Gln, Argl067His, Prol089Leu and Prol481Gln according to 
5 SEQ ID NO: 2. 

In one embodiment the polymorphic amino acid is located at position 30, 121, 514, 957, 
979 or 1371 according to SEQ ID NO: 190. 

In a particular embodiment the polymorphism is selected from the group consisting of: 
Gln30Arg, Serl21Gly , Glu514Gln, Arg957His, Pro979Leu and Prol371Gln according to 
10 SEQ ID NO: 190. 

The protein or nucleicracid containing test sample may conveniently be a sample of 
blood, bronchoalveolarjayage fluid, sputum, or other body fluid or tissue obtained from an 
individual It will be appreciated that the test sample may equally be a nucleic acid sequence 
corresponding to the sequence in the test sample, that is to say that all or a part of the region 

15 in the sample nucleic acid may firstly be amplified using any convenient technique e.g. PCR, 
before use in the analysis of DLG5 variation. 

It will be apparent to.the person skilled in the art that there are a large number of 
analytical proceduresiwhich may be used to detect the presence or absence of variant 
nucleotides at one orimorepolymorphic positions of the invention. In general, the detection 

20 of allelic variation requires a mutation discrimination technique, optionally an amplification 
reaction and optionally a signal generation system List 1 identifies a number of mutation 
detection techniques, some based on the polymerase chain reaction (PCR). These may be 
used in combination with a number of signal generation systems, a selection of which is listed 
in List 2. Further amplification techniques are listed in List 3. Many current methods for the 

25 detection of allelic variation are reviewed by Nollau et al, Clin. Chem 43, 1 1 14-1 120, 1997; 
and in standard textbooks, for example "Laboratory Protocols for Mutation Detection", Ed. by 
U. Landegren, Oxford University Press, 1996 and "PCR", 2 nd Edition by Newton & Graham, 
BIOS Scientific Publishers Limited, 1997. 
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Table 1: Abbreviations: 



ALEX™ 


AmDliflC&tlOn Tft'frflCtOI'V TTI1 1 1" Q ti r\n ©\/cfr^Tn linaor ovfortoiAn 


APEX 


Arraved orimer extendi nn 


ARMS™ 


Amplification rpfractnrv Trmtntirm cvctpm 

i uiA^/iuxwauuii xvuaviui y ll iulqi Kill oy5 ICLUL 


b-DNA 


Branched DNA i 


CMC 


Chemical mismatch clpava&p 


Bp 


base r>air 


COPS 


(""VYtTYnfttltlVP PlllCrf^Tni r*l yvf^nrMTi r» o victim 
VC/ \J ligVJIlu.L'ICAJ 11U.C ui ITT 1 ill p sySLcTH 


DGGE 


DftTlfltTTriTIO' OTPfllPnt ctrI f±ntrr\ nhnrpcic 
iv^uaim m% gicuaicuLL gCl ciCCtiopxioreSlS 


FRET 


Hill OTP ^PPTIPP TPQnnJlTIPP ^rtf^TTrx; fra-ncfer 

x luui vOvwiivv * cat ji idi isjc ciicrgy ix«xxsicr 


LCR 


T ^OflSP pVifnn TPflr»tifYn 


MASDA 


IvTultlTVlP flllplp QtlPf*i'fir* Hi ci nnnotir occov 


NASBA 


Mllplpir fir»lH CP/lllP-nrp "Koc<=»r? o T-vvtVI -J -PI <^ «-»♦•« 
i^u\*idl/ ctuiu. bCLjU-CllUC DaScU aXXipilTlCatlOli 


OLA 


till CTOTin r»1^ a /*\f"iH<^ lirrat -Jrvr* o c c m 7 
w AigvUJllXwldJ UUC 11^ cl LIU 11 doSciy 


PCR 


A wly JLLDCl Oou wJlaJJLl XCdL-LlUIl 


PTT 


Pro t pin lTiiTif*fltir\Ti t#»ct 

i JLVJLC/1JJ. U. UllOclLlUJJ. LCoL 


RFLP 


RftQlriptlf^TI ■frftOTYlPTlf' I^TlOtVi T^/^lxrmr^-rr^Vii o-r*-i 
Awo LA Xv» L1VJJJ. JLl dgiJUOlll lCIlglll XJwly I L JlllSTli 


SDA 


uUuUU V-ilo^Jla.L'CJLJLlCllL dlllL/111 lL-dLlUJLL 


SERRS 


oiuiauc wJJJLlail^/Cvi ldJUUall ICoUIlallCc SpcCiTOSCOpy 


SNP 


Single nucleotide polymorphism 


SSCP 


Single-strand coiiformation polymoiphism analysis 


SSR 


Self sustained replication 


TGGB 


Temperature gradient gel electrophoresis 


3'UTR 


3 ' untranslated region 



List 1 - Mutation Detection Techniques 
General: DNA sequencing, Sequencing by hybridisation 
5 ScanningL.PTT^ SSCP, DGGE, TGGE, Qeayase, Heteioduplex analysis, CMC, Enzymatic 
mismatch cleavage 

* Note: not useful for detection of promoter polymorphisms. 

Hybridisation Based: Solid phase hybridisation: Dot blots, MASDA, Reverse dot blots, 
Oligonucleotide arrays (DNA Chips) 
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Solution phase hybridisation: Taqman™ - US-5210015 & US-5487972 (Hoffmann-La 
Roche), Molecular Beacons - Tyagi et al (1996), Nature Biotechnology, 14, 303; WO 
95/13399 (Public Health Inst., New York) 

Extension Based: ARMS™-allele specific amplification (as described in European patent 
5 No. EP-B-332435 and US patent No. 5,595,890), ALEX™ - European Patent No. BP 332435 
Bl (Zeneca Limited), COPS - Gibbs et al (1989), Nucleic Acids Research, 17, 2347. 
Incorporation Based: Mini-sequencing, APEX 
Restriction Enzyme Based: RFLP, Restriction site generating PCR 
Ligation Based: OLA 
10 Other: Invader assay, Hybridisation protection assay 

List 2 - Si gnal Generation or Detection Systems 

Fluorescence: FRET* Fluorescence quenching, Fluorescence polarisation - United Kingdom 
Patent No. 2228998 (Zeneca Limited) 
15 Other: Chemiluminescence, Electrochemiluminescence, Raman, Radioactivity, Colorimetric, 
Mass spectrometry, SERRS - WO 97/05280 (University of Strathclyde). 

List 3 - Further Amnlifiration Methods 

SSR, NASBA, LCR.LSDA, b-DNA 
20 Preferred mutation detection techniques include ARMS™-allele specific amplification, 

Taqman™, Mini sequencing, sequencing, RFLP, ALEX™, OLA, restriction site based PCR 

and FRET techniques. 

Particularly preferred methods include ARMS™-allele specific amplification, OLA and 

RFLP based methods. ARMS™-allele specific amplification is an especially preferred 
25 method. 

ARMS™-allele specific amplification (described in European patent No. EP-B-332435, 
US patent No. 5,595,890 and Newton et al. (Nucleic Acids Research, VoL 17, p.2503; 1989)), 
relies on the complementarity of the 3' terminal nucleotide of the primer and its template. 
The 3' terminal nucleotide of the primer being either complementary or non-complementary 
30 to the specific mutation, allele or polymorphism to be detected. There is a selective advantage 
for primer extension from the primer whose 3' terminal nucleotide complements the base 
mutation, allele or polymorphism Those primers which have a 3' terminal mismatch with the 
template sequence severely inhibit or prevent enzymatic primer extension Polymerase chain 
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reaction or unidirectional primer extension reactions therefore result in product amplification 
when the 3' terminal nucleotide of the primer complements that of the template, but not, or at 
least not efficiently, when the 3' terminal nucleotide does not complement that of the 
template. 

5 It will be appreciated that the test sample may equally be a nucleic acid sequence 

corresponding to the sequence in the test sample, that is to say that all or a part of the region 
in the sample nucleic acid may firstly be amplified using any convenient technique e.g. 
polymerase chain reacticm (PGR), before analysis. The nucleic acid may be genomic DNA or 
fractionated or whole cell RNA. In one emb odiment the RNA is whole cell RNA and is used 
10 directly as the template for labelling a first strand cDNA using random primers or poly A 
primers. The nucleic acid or protein in the test sample may be extracted from the sample 
according to standard methodologies (Sambrook et al. 'Molecular Cloning- A Laboratory 
manual", second edition. Cold Spring Harbor, NY (1989)). 

It will be apparent that the gene sequence disclosed for dlg5 (as depicted in SEQ ID NO: 
15 1) is a representative sequence. In normal individuals there are two copies of each gene, a 
maternal and paternal copy, which will likely have some sequence differences, moreover 
within a population there will exist numerous allelic variants of the gene sequence, indeed the 
Examples identify numerous SNPs and other mutations within dlg5 gene that represent allelic 
variants within the population. It will be appreciated that the diagnostic methods and other 
20 aspects of this invention extend to the detection etc. of any of these sequence variants. 
Preferred sequence variants are those that possess at least 90% and preferably at least 95% 
sequence identity (nucleic acid or amino acid) to DLG5 depicted in SEQ ID No. 1 or 2. 
Nucleic acid sequence identity can also be gauged by hybridisation studies whereby, under 
stringent hybridisation and wash conditions, only closely related sequences (for example, 
25 those with >90% identity) are capable of forming a hybridisation complex. 

In a further aspect, the diagnostic methods of the invention, are used to assess the 
predisposition and/or susceptibility of an individual to IBD, and the present invention may be 
used to recognise individuals who are particularly at risk from developing IBD conditions. 
' * In a further aspect, the diagnostic methods of the invention are-used in the 
30 development of new drug therapies, which selectively target one or more allelic variants 
identified herein. Identification of a link between a particular allelic variant and 
predisposition to disease development or response to drug therapy may have a significant 
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impact on the design of new drugs. Drugs may be designed to regulate the biological activity 
of variants implicated in the disease process whilst minimising effects on other variants. 

In a further diagnostic aspect of the invention the presence or absence of variant 
nucleotides is detected by reference to the loss or gain of, optionally engineered, sites 
5 recognised by restriction enzymes. The person of ordinary skill will be able to design and 
implement diagnostic procedures based on the detection of restriction fragment length 
polymorphism due to the loss or gain of one or more of the sites. 

The invention further provides nucleotide sequence information, which can be used to 
design assays for detection of the polymorphisms of the invention. 
10 The invention further provides nucleotide primers, which detect the polymorphisms of 

the invention. 

The invention further provides nucleotide probes, which can detect the polymorphisms 
of the invention. 

The amino acid sequence method for diagnosis is preferably one which is determined by 

15 immunological methods such as enzyme linked immunosorbent assay (ELIS A). 

The levels of the DLG5 can be assessed from relative amounts of iriRNA, cDNA, 

* - genomic DNA or polypeptide sequence present in the test sample. Where RNA is used, it 
may be desired to convert the RNA to a complementary cDNA and during this process it may 
be desirable to incorporate a suitable detectable label into the cDNA. 

20 In a preferred embodiment the method of the invention relies on detection of mRNA 
transcript levels. This involves assessment of the relative mRNA transcript levels of dlg5 in a 
sample, and comparison of sample data to control data. The gene transcript can be detected 
individually, or, is preferably detected amongst a panel of other disease-linked gene dlg5 from 
which a transcript profile can be generated Levels of dlg5 mRNA in the test sample can be 

25 detected by any technique known in the art. These include Northern blot analysis, reverse 
transcriptase-PCR amplification (RT-PCR), microarray analysis and RNAse protection. 
In one embodiment, levels of dlg5 RNA in a sample can be measured in a Northern blot 
assay. Here, tissue RNA is fractionated by electrophoresis, fixed to a solid membrane 
support, such as nitrocellulose or nylon, and hybridised to a probe or probes capable of 

30 selectively hybridising with the dlg5 RNA to be detected. The actual levels may be 

quantitated by reference to one or more control housekeeping genes. Probes may be used 
singly or in combination. This may also provide information on the size of mRNA detected 
by the probe. Housekeeping genes are genes which are involved in the general metabolism or 
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maintenance of the cell, and are considered to be expressed at a constant level irrespective of 
cell type, physiological state or stage in the cell cycle. Examples of suitable housekeeping 
genes are: beta actin, GAPDH, histone H3.3 or ribosomal protein L13 (Koehler et aL, 
Quantitation of mRNA by Polymerase Chain Reaction. Springer- Verlag, Germany (1995)). 
5 To gauge relative expression levels, a control sample can be run alongside the test sample or, 
the test result/value can be compared to dlg5 expression levels expected in a normal or control 
tissue. These control values can be generated from prior test experiments using normal or 
control tissues, to generate mean or normal range values for dlg5. 

In another e mbodiment, the dlg5 nucleic acid in a tissue sample is amplified and 

10 quantitatively assayed. The polymerase chain reaction (PCR) procedure can be used to 
amplify specific nucleic acid sequences through a series of iterative steps including 
denaturation, annealing jofcoligonucleo tide primers (designed according to the sequence 
disclosed in SEQ ID NO&l)^ and extension of the primers with DNA polymerase (see, for 
example, Mullis, et aL, UtS,< patent No. 4,683,202; Loh et aL, Science 243:217 (1988)). In 
15 reverse txanscriptase-PCR^RT-PCR) this procedure is preceded by a reverse transcription step 
to allow a large amplification of the number of copies of mRNA (Koehler et aL, supra). 
Other known nucleic acid r amplification procedures include transcription-based amplification 
systems (TAS) such as nucleic acid based sequence application (NASB A) and 3SR (Kwoh et 
aL, Proc NatL Acad SciUSA 86:1173 (1989), Gingeras et aL, PCT application WO 
20 88/10315), the ligase chain reaction (LCR, see European Application No. 320308), Strand 
Displacement Amplification (SDA), ''race", "one sided PCR" and others (Frohman, PCR 
Protocols: a Guide to Methods and Applications. Academic Press, NY (1990); Ohara et aL, 
Proc Natl Acad Scl USA 86:5673-5677 (1989)). Quantitation of RT-PCR products can be 
done while the reaction products are building up exponentially, and can generate 
25 diagnostically useful clinical data. In one embodiment, analysis is carried out by reference to 
one or more housekeeping genes which are also amplified by RT-PCR. Quantitation of RT- 
PCR product may be undertaken, for example, by gel electrophoresis visual inspection or 
image analysis, HPLC (Koehler et aL, supra) or by use of fluorescent detection methods such 
as intercalation labelling-Taqman probe (Higuchi et-al.„ .Biotechnology 10:413-417 (1992)), 
30 Molecular Beacon (Piatek et aL, Nature BiotechnoL 4:359-363 (1998)), primer or Scorpion 
primer (Whitcombe et aL, Nature Biotech 17:804-807 (1999)); or other fluorescence detection 
method, relative to a control housekeeping gene or genes as discussed above. 
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Dlg5 RNA measurements can also be carried out on sinovial fluid, blood or serum 
samples. Preferably, the RNA is obtained from a peripheral blood sample. In the case of 
soluble RNA in the blood serum, the low abundance of mRNA expected would necessitate a 
sensitive test such as RT-PCR (Kopreski et al., Clin Cancer Res 5: 1961-5 (1999)). A whole 
5 blood gradient may be performed to isolate nucleated cells and total RNA is extracted such as 
by the Rnazole B method (Tel-Test Inc., Friendsworth, Tex.) or by modification of methods 
known in the art such as described in Sambrook et aL, {supra). 

In a preferred embodiment of the invention, the diagnosis/detection method of the 
invention involves assessing dlg5 transcript levels using microarray analysis. Microarray 
10 technology makes it possible to simultaneously study the expression of many thousands of 
genes in a single experiment. Analysis of gene expression in human tissue (e.g. biopsy tissue) 
can assist in the diagnosissmd prognosis of disease and the evaluation of risk for disease. A 
comparison of levels oftexpression of various genes from patients with defined pathological 
disease conditions with normal patients enables an expression profile, characteristic of 
15 disease, to be created. 

Probes are made that selectively hybridise to the sequences of the target dlg5 gene in the 
test sample. These probes, perhaps together with other probes and control probes, are bound 
at discrete locations on. a. suitable support medium such as a nylon filter or microscope slide to 
form a transcript profiling-array. The diagnostic method involves assessing the relative 
20 mRNA transcript level of dlg5 in a clinical sample. This can be done by radioactively 

labelling, or non-radioactively labelling the tissue mRNA, which can be optionally purified 
from total RNA, in any of a number of ways well known to the art (Sambrook et al., supra). 
The probes can be directed to any part or all of the target dlg5 mRNA. 

In another embodiment of the invention, total dlg5 RNA or DNA is quantified and 
25 compared to levels in control tissue or expected levels from pre tested standards. DNA and/or 
RNA may be quantified using techniques well known in the art. Messenger RNA is often 
quantitated by reference to internal control mRNA levels within the sample, often relative to 
housekeeping genes (Koehler et al., supra). 

In a preferred embodiment hybridisation signals generated are measured by corcputer 
30 software analysis of images on phosphorimage screens exposed to radioactively labelled 
tissue RNA hybridised to a microarray of probes on a solid support such as a nylon 
membrane. In another, quantities are measured by densitometry measurements of radiation- 
sensitive film (e.g. X-ray film), or estimated by visual means. In another embodiment 
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quantities are measured by use of fluorescently labelled probe, which may be a mixture of 
biopsy and normal RNA differentially labelled with different fluorophores, allowing 
quantities of dlg5 mRNA to be expressed as a ratio versus the normal level. The solid support 
in this type of experiment is generally a glass microscope slide, and detection is by 
5 fluorescence microscopy and computer imaging. 

The detection of specific interactions may be performed by detecting the positions 
where the labelled target sequences are attached to the array. Radiolabeled probes can be 
detected using conventional autoradiography techniques. Use of scanning autoradiography 

witfa * digitised scanner and suitable software for an alysing the results « prefeged Bftsffi the 

10 label is a fluorescent label, the apparatus described, e.g. in International Publication No. WO 
90/15070; US Patent No. 5, 143,854 or US Patent No. 5,744,305 may be advantageously 
applied. Indeed, most array formats use fluorescent readouts to detect labelled capturertarget 
duplex formation Laser confocal fluorescence microscopy is another technique routinely in 
use (Kozal et aL, Nature Medicine 2:753-759 (1996)). Mass spectrometry may also be used 
15 to detect oligonucleotides bound to a DNA array (Little et al, Analytical Chemistry 69: 4540- 
4546, (1997)). Whatever the reporter system used, sophisticated gadgetry and software may 
- be required in order to interpret large numbers of readouts into meaningful data (such as 
described, for example, in US Patent No. 5,800,992 or International Publication No. WO 
90/04652). 

~ 20 In a preferred embodiment of the microarray test, the dlg5 RNA measurement is 

generated as a value relative to an internal standard (i.e. a housekeeping gene) known to be 
constant or relatively constant. The histone H3.3 and ribosomal protein L19 housekeeping 
genes have been shown to be cell-cycle independent and constitutively expressed in all tissues 
(Koehler et aL , supra). For normalisation of data, several different housekeeping genes can 
25 be used to generate an average housekeeping measurement. 

A microarray or RT-PCR test to detect IBD or susceptibility thereto can be used where 
tissue samples containing mRNA are available. 

Samples for RNA extraction must be treated promptly to avoid RNA degradation 
• -(Sambrooket-alT^rd). This entails either prompt extraction using e.g. phenotbaled- * 
30 reagents or snap freezing in e.g. liquid nitrogen Samples can be stored at -70°C or less until 
RNA can be extracted at a later date. Proprietary reagents are available which allow tissue or 
cells to be conveniently stored for several days at room temperature and up to several months 
at 4<>C (e.g. RNAlater, Ambion Inc., TX ). Prior to extraction, methods such as grinding, 
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blending or homogenisation are used to dissipate the tissue in a suitable extraction buffer. 
Typical protocols then use solvent extraction and selective precipitation techniques. 
In another embodiment oligonucleotide probe(s) capable of selectively hybridising to dlg5 
nucleic acid, can be used to detect levels of dlg5 gene expression. 
5 Convenient DNA sequences for use in the various aspects of the invention may be 

obtained using conventional molecular biology procedures, for example by probing a human 
genomic or cDNA library with one or more labelled oligonucleotide probes containing 10 or 
more contiguous nucleotides designed rising the nucleotide sequences described here. 
Alternatively, pairs of oligonucleotides one of which is homologous to the sense strand and 
10 one to the antisense strand, designed using the nucleotide sequences described herein to flank 
a specific region of DNA may be used to amplify that DNA from a cDNA library. 

Levels of dlg5gene expression can also be detected by screening for levels of 
polypeptide (DLG5 protein). For example, monoclonal antibodies immunoreactive with 
DLG5 protein can be used to screen a test sample. Such immunological assays can be done in 
15 any convenient format known in the art These include Western blots, immunohistocheiiiical 

assays and ELIS A assays. Functional assays can also be used, such as protein binding 
~ determinations. 

According to another aspect of the present invention, there is provided an allele specific 
* primers or probes capable of detecting a polymorphism at position 16 in one or more of SEQ 
20 ID NOs: 7, 9, 11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 
53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 75, 77, 79, 81, 83, 85, 87, 89, 91 and 93. The person 
of ordinary skill in the art will be able to design suitable primers or probes using the sequence 
information provided herein. 

An allele specific primer is used, generally together with a constant primer, in an 
25 amplification reaction such as a PGR reaction, which provides the discrimination between 
alleles through selective amplification of one allele at a particular sequence position e.g. as 
used for ARMS™ assays. The allele specific primer is preferably 17- 50 nucleotides, more 
preferably about 17-35 nucleotides, more preferably about 17-30 nucleotides. 

An allele specific primer preferably corresponds exactly with the allele to be detected 
30 but derivatives thereof are also contemplated wherein about 6-8 of the nucleotides at the 3* 
terminus correspond with the allele to be detected and wherein up to 10, such as up to 8, 6, 4, 
2, or 1 of the remaining nucleotides may be varied without significantly affecting the 
properties of the primer. 
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Preferred primers for amplification are between 15 and 60 bases, more preferably 
between 17 and 35bases in length. Probe sequences can be anything from about 25 
nucleotides in length upwards. If the target sequence is a gene of 2kb in size the probe 
sequence can be the complete gene sequence complement and thus may also be 2kb in size 
5 Preferably, the probe sequence is a genomic, or more preferably a cDNA, fragment of the 
target sequence and may be between 50 and 2000 bases, preferably between 200 and 750 
bases. It will be appreciated that multiple probes each capable of selectively hybridising to a 
different target sequence of the dlg5 nucleic acid, maybe across the complete length of the 

dl&5 gePe SeqUenCe> "»* be ggPgg *** together in a diagnostic test. The primer, or 

10 probes may be completely homologous to the target sequence or may contain one or more 
mismatches to assist specificity in binding to the correct template sequence. Any sequence 
which is capable of selectively hybridising to the target sequence of interest, may be used as a 
suitable primer or probe^equence. It will also be appreciated that the probe or primer 
sequences must hybridise to the target template nucleic acid. If the target nucleic acid is 
15 double stranded (genomic or cDNA) then the probe or primer sequence can hybridise to the 
sense or antisense strand.. .If however the target is mRNA (single stranded sense strand) the 
pnmer/probe sequence will have to be the antisense complement. 

An example of a suitable hybridisation solution when a nucleic acid is immobilised on a 
nylon membrane and,the probe nucleic acid is greater than 500 bases or base pairs is- 6 x SSC 
20 (saline sodium citrate), 0.5% SDS (sodium dodecyl sulphate), 100^g/ml denatured, sonicated 
salmon sperm DNA. :The hybridisation being performed at 68°C for at least 1 hour and the 
filters then washed at 68°C in 1 x SSC, or for higher stringency, 0. 1 x SSC/0. 1% SDS. 

An example of a suitable hybridisation solution when a nucleic acid is immobilised on a 
nylon membrane and the probe is an oligonucleotide of between 12 and 50 bases is- 3M 
25 trimethylammonium chloride (TMAC1), 0.01M sodium phosphate (pH 6.8), ImM EDTA (pH 
7.6) , 0.5% SDS,100ng/ml denatured, sonicated salmon spermDNA and 0.1 dried skimmed 
milk. The optimal hybridisation temperature (Tm) is usually chosen to be 5°C below the Ti 
_ of the hybrid chain. Ti is the irreversible melting temperature of the hybrid formed between 
the probe and its target. If there are any mismatches between the prob~e and the target the Tm 
30 will be lower. As a general guide, the recommended hybridisation temperature for 17-mersin 
3M TMAC1 is 48-50°C; for 19-mers,it is 55-57°C; and for 20-mers, it is 58-66°C. 

According to another aspect of the present invention there is provided an allele-speciflc 
oligonucleotide probe capable of detecting a polymorphism in human nucleic acid 
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corresponding to that at position 16 of any of SEQID NOs: 7, 9, 11, 13, 15, 17, 19, 21 23 
25, 27, 29, 31, 33, 35, 37, 39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 
75, 77, 79, 81, 83, 85, 87, 89, 91 and 93. 

The allele-specific oligonucleotide probe is preferably 17- 50 nucleotides, more 
5 preferably about 17-35 nucleotides, more preferably about 17-30 nucleotides. 

The design of such probes will be apparent to the molecular biologist of ordinary skill. 
Such probes are of any convenient length such as up to 50 bases, up to 40 bases, more 
conveniently up to 30 bases in length, such as for example 8-25 or 8-15 bases in length. In 
general such probes will comprise base sequences entirely complementary to the 
10 corresponding wild type or variant locus in the gene. However, if required one or more 

mismatches may be introduced, provided that the discriminatory power of the oligonucleotide 
probe is not unduly affected. The probes of the invention may carry one or more labels to 
facilitate detection. The sequences disclosed as SEQ ID Nos: 6-93, when in single stranded 
form, are representative examples of allele specific probes capable of detecting one or other of 
15 the polymorphic variants ofdlgS. Each of these sequences is fully complementary to the native 
dlg5 gene and one or other of the particular allelic variants. 

Primers or probes for use in any of the methods of the invention may be manufactured 
using any convenient method of synthesis. Examples of such methods may be found in 
standard textbooks, for example "Protocols for Oligonucleotides and Analogues; Synthesis 
20 and Properties," Methods in Molecular Biology Series; Volume 20; Ed. Sudhir Agrawal 
Humana ISBN: 0-89603-247-7 (1993); 1 st Edition. If required the primer(s) may be labelled 
to facilitate detection. 

There are many conventional detectable labels such as radioisotopes, fluorescent labels, 
cheiniluminescent compounds, labelled binding proteins, magnetic labels, spectroscopic 

25 markers and linked enzymes that might be used in conjunction with the primers or probes of 
the invention. One particular example well known in the art is end-labelling with 3 ¥. 
Fluorescent labels are preferred because they are less hazardous than radiolabels, they provide 
a strong signal with low background and various different fluorophors capable of absorbing 
light at different wavelengths and/or giving off different colour signals exist to enable 

30 comparative analysis in the same analysis. For example, fluorescein gives off a green colour, 
rhodamine gives off a red colour and both together give off a yellow colour. 

The oligonucleotide primers and probes of the invention are particularly suitable for 
detecting the genotype of a particular SNP of dlg5. 
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The DLG5 protein of the invention and homologues or fragments thereof may he used 
to generate substances which selectively bind to it and in so doing regulate the activity of the 
protein Such substances include, for example, antibodies, and the invention extends in 
particular to an antibody which is capable of binding to the protein shown in SEQ ID No:2. 

5 In particular the antibody may be a neutralising antibody. 

As used herein the term antibody is to be understood to mean a whole antibody or a 
fragment thereof, for example a F(ab)2, Fab, FV, VH or VK fragment, a single chain 
antibody, a multimeric monospecific antibody or fragment thereof, or a bi- or multi-specific 
antibody or fragment thereof. Each of these ty pes of antibody derivative and their acronyms 

10 are well known to the person skilled in the art. ~ ~ 

In another preferred. embodiment antibodies directed against DLG5 protein can be used, 
to detect, prognose, diagnose and stage IBD. Various histological staining methods known in 
the art, including immunochemical staining methods, may also be used. Silver stain is but 
one method of detecting DLG5 proteins. For other staining methods useful in the present 

15 invention see, for example, A Textbook of Histology, Eds. Bloom and Fawcett, W.B. 
Saunders Co., Philadelphia (1964). 

According to a further aspect of the invention there is provided use of an antibody 
selective for DLG5 protein, in an assay to diagnose or prognose or monitor IBD. 

The antibodies for use in this aspect of the invention can be prepared using the DLG5 

20 protein/polypeptides. 

Methods of making and detecting labelled antibodies are well known (Campbell; 
Monoclonal Antibody Technology, in: Laboratory Techniques in Biochemistry and Molecular 
Biology, Volume 13. Eds: Burdon R et al Elsevier, Amsterdam (1984)). The term antibody 
includes both monoclonal antibodies, which are a substantially homogeneous population, and 

25 polyclonal antibodies which are heterogeneous populations. The term also includes inter aha, 
humanised and chimeric antibodies. Monoclonal antibodies to specific antigens may be 
obtained by methods known to those skilled in the art, such as from hybridoma cells, phage 
display libraries or other methods. Monoclonal antibodies may be inter alia, human, rat or 
mouse derived.-For the-production of-human monoclonal-antibodies, nybridoma. cells may be . 

30 prepared by fusing spleen cells from an immunised animal, e.g. a mouse, with a tumour cell. 
Appropriately secreting hybridoma cells may thereafter be selected (Koehler & Milstein, 
Nature 256:495-497 (1975); Cole et al., "Monoclonal antibodies and Cancer Therapy", Alan 
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R Liss Lie, New York N. Y. pp 77-96 (1985)). Such antibodies may be of any 
immunoglobulin class including IgG, IgM, IgE, IgA, IgD and any subclass thereof. 
Polyclonal antibodies can be generated by immunisation of an animal (such as a mouse, rat, 
goat, horse, sheep etc) with an antigen, such as a DLG5 polypeptide. 
5 The DLG5 polypeptides) can be prepared by various techniques known to the person 

skilled in the art. RNA transcripts can be used to prepare a polypeptide of the invention by in 
vitro translation techniques according to known methods (Sambrook et al supra). 

Alternatively, the DLG5 polypeptide(s) can be synthesised chemically. For example, by 
the Merryfield technique (J. Amer. Chem. Soc. 85:2149-2154, (1968)). Numerous automated 

10 polypeptide synthesisers>-such as Applied Biosystems 431 A Peptide Synthesizer also now 
exist. Alternatively, and^preferably, the DLG5 polypeptide(s) are produced from a nucleotide 
sequence encoding the.[polypeptide using recombinant expression technology. A variety of 
expression vector/host systems may be used to express the dlgS coding sequences. These 
include, but are not Mmited to microorganisms such as bacteria expressed with plasmids, 

15 cosmids or bacteriophage;-yeasts transformed with expression vectors; insect cell systems 
transfected withbaculovirus expression systems; plant cell systems transfected with plant 
virus expression systems, such as cauliflower mosaic virus; or mammalian cell systems (for 
example those transfected with adenoviral vectors); selection of the most appropriate system 
is a matter of choice.; Preferably, the DLG5 protein is expressed in eukaryotic cells, 

20 especially mammalian, insect and yeast cells. Mammalian cells provide post-translational 
modifications to recombinant DLG5 protein, which include folding and/or phosphorylation. 

Expression vectors usually include an origin of replication, a promoter, a translation 
initiation site, optionally a signal peptide, a polyadenylation site, and a transcription 
termination site. These vectors also usually contain one or more antibiotic resistance marker 

25 gene(s) for selection. As noted above, suitable expression vectors may be plasmids, cosmids 
or viruses such as phage or retroviruses. The coding sequence of the polypeptide is placed 
under the control of an appropriate promoter, control elements and transcription terminator so 
that the nucleic acid sequence encoding the polypeptide is transcribed into RNA in the host 
cell transformed or transfected by the expression vector construct The coding sequence may 

30 or may not contain a signal peptide or leader sequence for secretion of the polypeptide out of 
the host cell Expression and purification of the DLG5 polypeptide(s) can be easily 
performed using methods well known in the art (for example as described in Sambrook et aL 
supra). 
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The DLG5 polypeptide(s) so produced can then be used to inoculate animals, from 
which serum samples, containing the specific antibody against the introduced DLG5 
protein/polypeptide, can later be obtained. 



Rodent antibodies may be humanised using recombinant DNA technology according to 

5 techniques known in the art. Alternatively, chimeric antibodies, single chain antibodies, Fab 
fragments may also be developed against the polypeptides of the invention (Huse et al., 
Science 256:1275-1281 (1989)), using skills known in the art. Antibodies so produced have a 
number of uses, which will be evident to the molecular biologist or immunologist skilled in 
the art. Such uses include, but are not limited to, monitoring enzyme expression, 

10 development of assays to measure enzyme activity and use as a therapeutic agent. Enzyme — 
linked immunosorbant assays (ELISAs) are well known in the art and would be particularly 
suitable for detecting the DLG5 protein or polypeptide fragments thereof in a test sample. 

The DLG5 specific antibodies can be used in an ELISA assay to detect DLG5 protein in 
body fluids or by irnmunobistochernistry or other means. In addition, an antibody could be 

15 used individually or as part of a panel of antibodies, together with a control antibody, which 
reacts to a common protein, on a dipstick or similar diagnostic device. 

All the essential materials and reagents required for detecting DLG5 in a test sample 
may be assembled together in a kit. Such a kit may comprise one or more diagnostic cDNA 
probes or oligonucleotide primers together with control probes/primers. The kit may contain 

20 probes immobilised on a microarray substrate such as a filter membrane or silicon-based 
substrate. The kit may also comprise samples of total RNA derived from tissues of various 
physiological states, such as normal, BPH, confined tumour and metastatic tumour, for 
example, to be used as controls. The kit may also comprise appropriate packaging and 
instructions for use in the methods of the invention. 

25 According to another aspect of the present invention there is provided a diagnostic kit 
for diagnosing or prognosing or monitoring BBD comprising, one or more diagnostic probe(s) 
and/or diagnostic primer(s) and/or antibodies capable of selectively hybridising or binding to 
DLG5. 

— .It-will be.appreciated that the term, "diagnostic kit" is not intended to limkthe kit to 
30 diagnostic use only, it also encompasses other uses such as in prognostic, stage monitoring 
and therapeutic efficacy studies. 

In a preferred embodiment, the diagnostic (detection) probes are provided on a 
microarray. 
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Such kits may further comprise appropriate buffer(s) and/or polymerase(s) such as 
thermostable polymerases, for example taq polymerase. They may also comprise 
companion/constant primers and/or control primers or probes. A companion/constant primer 
is one that is part of the pair of primers used to perform PCR. Such primer usually 
5 complements the template strand precisely. The kit may also contain control normal RNA 
labelled with one fluorophore (E.g.. Cy5). In use, patient RNA derived from biopsy or body 
fluids or cells can be labelled with another fluorophore (e.g. Cy3), the RNAs could then be 
mixed and hybridised to the array. Instrumentation to detect fluorescence ratio e.g. of. 
Cy3:Cy5 are available and could be used to detect DLG5 over-expression. 
10 In another embodiment the kit comprises one or more specific probes suitable for 

hybridisation to mRNA in tissue sections in situ. The kit may also contain hybridisation 
buffer and detection reagents for colourimetric or fluorescence microscopy detection. 
In another embodiment the kit comprises a set of specific oligonucleotide primers, optionally 
labelled, for quantitation by RT-PCR of dlgSmRNA. These primers may be Scorpion primers 
- 15 (Whitcombe et al, Nature Biotechnol. 17:804-807, 1999) allowing accurate quantitation of 
n specific PCR product Alternatively, Taqman or Molecular Beacon probes may be provided 
= in the kit for this purpose. One form of the kit would be a microtitre plate containing specific 
■ » reagents in several wells, to which aliquots of extracted RNA could be pipetted. The 

microtitre plate could be thermocycled on a suitable machine, which could also be capable of 
" 20 reading fluorescence emissions from plate wells (e.g. Perkin Blmer 7700). 

In another embodiment the kit comprises one or more antibodies specific for the DLG5 
protein for use in immunohistochemical analysis. 

In another embodiment the kit is an ELISAkit comprising one or more antibodies 
specific for the DLG5 protein identified herein. 
25 In another aspect of the invention there is provided a method for treating a patient 

suffering from DBD comprising administering to the patient an effective amount of an 
antibody specific for DLG5. 

According to another aspect of the invention, the dlg5 gene may be used in gene 
therapy, for example where it is desired to modify the production of the protein in vivo, and 
30 the invention extends to such uses. 

Knowledge of the gene according to the invention also provides the ability to regulate 
its expression in vivo by for example the use of antisense DNA or RNA. One therapeutic 
means of inhibiting or dampening the expression levels of a particular gene (for example dlg5 
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identified herein) is to use antisense therapy. Antisense therapy utilises antisense nucleic acid 
molecules that are synthetic segments of DNA or RNA ("oligonucleotides"), designed to 
mirror specific mRNA sequences and block protein production. Once formed, the mRNA 
binds to a ribosome, the cell's protein production "factory" which effectively reads the RNA 
5 sequence and manufactures the specific protein molecule dictated by the gene. If an antisense 
molecule is delivered to the cell (for example as native oligonucleotide or via a suitable 
antisense expression vector), it binds to the messenger RNA because its sequence is designed 
to be a complement of the target sequence of bases. Once the two strands bind, the mRNA 

C3D °° lQDger dictate ft* manufacture of encoded protein by the ribosome and is ra pidly 

10 broken down by the cell's enzymes, thereby freeing the antisense oligonucleotide to seek and 
disable another identicalsnessenger strand of mRNA. 

Thus, according tosmother aspect of the invention there is provided a method for 
treating a patient sufferingifrom IBD comprising administering to said patient an effective 
amount of an antisense molecule capable of binding to the mRNA of the dlg5 gene, and 
15 inhibiting expression of the protein product of the dlg5 gene. 

Complete inhibition, of protein production is not essential, indeed may be detrimental. It 
is likely, that inhibition to a state similar to that in normal tissues would be desired. 

This aspect of antisense therapy is particularly applicable if the IBD disorder is a direct 
cause of over-expression of the dlg5 gene in question, although it is equally applicable if said 
20 dlg 5 gene indirectly -cause the IBD disorder. With knowledge of the dlg5 gene and mRNA 
sequence, the person, skilled in the art is able to design suitable antisense nucleic acid 
therapeutic molecules and administer them as required. 

Antisense oligonucleotide molecules with therapeutic potential can be determined 
experimentally using well established techniques. To enable methods of down-regulating 
25 expression of the dlg5 gene of the present invention in mammaUan cells, an example antisense 
expression construct can be readily constructed for instance using the pREPIO vector 
(Invitrogen Corporation). Transcripts are expected to inhibit translation of the gene in cells 
transfected with this type of construct. Antisense transcripts are effective for inhibiting 
— translation of thenative gene transcript, and capable of inducing-the-effects (e.g., regulation of 
30 tissue physiology) herein described. Oligonucleotides which are complementary to and 
hybridisable with any portion of dlg5 gene mRNA are contemplated for therapeutic use U S 
Patent No. 5,639,595, 'Identification ofNovelDrugs and Reagents", issued Jun. 17, 1997, 
wherein methods of identifying oligonucleotide sequences that display in vivo activity are' 
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thoroughly described, is herein incorporated by reference. Expression vectors containing 
random oligonucleotide sequences derived from the dlg5 gene sequence are transformed into 
cells. The cells are then assayed for a phenotype resulting from the desired activity of the 
oligonucleotide. Once cells with the desired phenotype have been identified, the sequence of 

5 the oligonucleotide having the desired activity can be identified. Identification may be 
accomplished by recovering the vector or by polymerase chain reaction (PCR) amplification 
and sequencing the region containing the inserted nucleic acid material Antisense molecules 
can be synthesised for antisense therapy. These antisense molecules may be DNA, stable 
derivatives of DNA such as phosphorothioates ormethylphosphonates, RNA, stable 

10 derivatives of RNA such-as 2'-0-alkylRNA, or other oligonucleotide mimetics. U.S. Patent 
No. 5,652,355, "Hybridphgonucleotide Phosphorothioates", issued July 29, 1997, and U.S. 
Patent No. 5,652,356, "Inverted Chimeric and Hybrid Oligonucleotides", issued July 29, 
1997, which describe the^synthesis and effect of physiologically-stable antisense molecules, 
are incorporated by reference. Antisense molecules may be introduced into cells by 

15 microinjection, hposome..encapsulation or by expression from vectors harboring the antisense 
sequence. '; 

As noted above; antisense nucleic acid molecules may also be provided as RNAs, as 
some stable forms offRN A are now known in the art with a long half-life that may be 
administered directly? Without the use of a vector. In addition, DNA constructs may be 
20 delivered to cells by liposomes, receptor mediated transfection and other methods known to 
the art. 

The antisense DNA or RNA for co-operation with the gene in SEQ ID No: 1 can be 
produced using conventional means, by standard molecular biology and/or by chemical 
synthesis as described above. If desired, the antisense DNA or antisense RNA may be 
25 chemically modified so as to prevent degradation in vivo or to facilitate passage through a cell 
membrane and/or a substance capable of inactivating mRNA, for example ribozyme, may be 
linked thereto and the invention extends to such constructs. 

The antisense DNA or antisense RNA may be of use in the treatment of diseases or 
disorders in humans in which the over- or under-regulated production of the dlg5 gene 
30 product has been implicated. 

Alternatively, ribozyme molecules may be designed to cleave and destroy the dlg5 
mRNA in vivo. Ribozymes are RNA molecules that possess highly specific endoribonuclease 
activity. Hammerhead ribozymes comprise a hybridising region, which is complementary in 
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nucleotide sequence to at least part of the target RNA, and a catalytic region, which is adapted 
to recognise and cleave the target RNA The hybridising region preferably contains at least 9 
nucleotides. The design, construction and use of such ribozymes is well known in the art and 
is more fully described in Haselhoff and Gerlach, (Nature. 334:585-591, 1988). Inanother 
5 alternative oligonucleotides designed to hybridise to the 5>-region of the dlg5 gene so as to 
form triple helix structures may be used to block or reduce transcription of the dlg 5 gene. In 
another alternative, RNA interference (RNAi) oligonucleotides or short (18-25bp) RNAi dlg5 
sequences cloned into plasmid vectors are designed to introduce double stranded RNA into 
mammalian cells to inhibit and/or result in the d egradation of dlg5 messenger RNA. Dlg5 

1U RNAi molecules may begin adenine/adenine (AA ) or at least (any base-A^C or G)A. . .. and 
may comprise of 18 or 19 or 20 or 21 or 22 or 23, or 24 or 25 base pair double stranded RNA 
molecules with the preferred length being 21 base pairs and be specific to individual dlg5 
sequences with 2 nucleotide 3' overhangs or hairpin forming 45-50mer RNA molecules. The 
design, construction and use of such small inhibitory RNA molecules is well known in the art 

15 and is more fully described in the following: Elbashir et al, (Nature. 411(6836):494-498, 
2001); Elbashir et al, (Genes & Dev. 15:188-200, 2001); Harborth, J. et al. (J. Cell Science 
114:4557-4565, 2001); Masters et al (Proc. Nad Acad. ScL USA 98:8012-8017, 2001); and, 
Tuschl et al, (Genes & Dev. 13:3 191-3 197, 1999). 

Pathway mapping may be used to determine each protein in the cell with which the 

20 DLG5 protein interacts and, in turn, the proteins with which each of these proteins interacts 
also. In this way it is possible to identify the specific critical signaling pathway which links 
the disease stimulus to the cell's response thereby enabling the identification of new potential 
targets for therapy intervention. 

According to a further aspect of the invention there is provided the use of the dlg5 gene 

25 or a fragment thereof in research to identify further gene targets imphcated in IBD. 

In another aspect of the invention, the single nucleotide polymorphisms of this invention 
may be used as genetic markers for this region in linkage studies. 
Further features of the invention include: 

- • A method_pf .treatment of a patient suffering from inflammatory bowel disease, 
30 c^rismg administ^ " 
or activity of dlg5 gene products. 

A method of treatment of a patient suffering from IBD, comprising administration to the 
patient an inhibitory nucleic acid molecule targeted against the mRNA of dlg5. 
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Use of m inhibitory nucleic acid molecule against dlgS or an antibody directed against 
DLG5 proteins, in the manufacture of a medicament for treating IBD. 

As used herein, the term "inhibitory nucleic acid molecule" refers to molecules selected 
from the group consisting of: antisense, ribozyme, triple helix aptemer and RNAi molecules. 
5 According to a further aspect of the invention there is provided a method of treating a 

human in need of treatment with a small molecule drug acting on the DLG5 protein or a drug 
comprising an inhibitory nucleic acid molecule acting against dlgS, in which the method 
comprises: 

i) detection of a polymorphism in the dlg5 gene in the human, which diagnosis preferably 
10 comprises determining the nucleotide present within human dlg5 gene that occurs at position 

16 in the nucleic acid that corresponds to any of SEQ ID Nos: 7, 9, 11, 13, 15, 17, 19, 21, 23, 
25, 27, 29, 31, 33, 35, 37fl39, 41, 43, 45, 47, 49, 51, 53, 55, 57, 59, 61, 63, 65, 67, 69, 71, 73, 
75, 77, 79, 81, 83, 85, 87|E89, 91 and 93; 

ii) determining the status"of the human by reference to polymorphism in the dlg5 gene; and, 
15 iii) administering an effective amount of the drug. 

According to aiurther aspect of the invention there is provided a method of treating a 
human in need of treatment with a small molecule drug acting on the DLGS protein or a drug 
comprising an inhibitory nucleic acid molecule acting against the dlg5 mRNA, in which the 
method comprises: 

20 i) measuring the level of the dlg5 mRNA in a tissue sample obtained from the human and, 

ii) determining the status of the human by reference to normal levels of the dlg5 mRNA; and, 

iii) administering an effective amount of the drug. 

According to a further aspect of the invention there is provided a method of treating a 
human in need of treatment with a small molecule drug acting on the DLG5 protein or a drug 
25 comprising an inhibitory nucleic acid molecule acting against the dlg5 tnRNA, in which the 
method comprises: 

i) measuring the level of the DLGS protein in a tissue sample obtained from the human and, 

ii) determining the status of the human by reference to normal levels of the DLGS protein; 
and, 

30 iii) administering an effective amount of the drug. 

According to a further aspect of the invention there is provided a method of treating a 
human in need of treatment with a small molecule drug acting on the DLGS protein or a drug 
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comprising an inhibitory nucleic acid molecule acting against the dlg 5 mRNA, in which the 
method comprises: 

i) detection of a polymorphism in the DLG5 protein in the human, which diagnosis preferably 
comprises determining the amino acid at any one of positions 140, 231, 624, 1067, 1089 or 

5 1481 of the DLG5 protein sequence shown in SEQ ID NO: 2; 

ii) determining the status of the human by reference to polymorphism in the DLG5 protein; 
and, 

iii) administering an effective amount of the drug. 

According to a further aspect of the invention there is provided a method of treating a 

10 human in need of treatment with a small molecule drug acting on the DLG5 protein or a drug 
comprising an inhibitory nucleic acid molecule acting against the dlg5 mRNA, in which the 
method comprises: 

i) detection of a polymorphism in the DLG5 protein in the human, which diagnosis preferably 
comprises determining the amino acid at any one of positions 30, 121, 514, 957, 979 and 1371 

15 of the DLG5 protein sequence shown in SEQ ID NO: 190; 

ii) determining the status of the human by reference to polymorphism in the DLG5 protein; 
and, 

iii) administering an effective amount of the drug. 

A method of treatment of a patient suffering from IBD, comprising administration to the 
20 patient of a compound capable of reducing the transcription or expression of dlg5. 

A method of treatment of a patient suffering from IBD, comprising administration to the 
patient an inhibitory nucleic acid molecule targeted against the mRNA of dlg5. 

Use of an inhibitory nucleic acid molecule or an antibody directed against dlg5, in the 
manufacture of a medicament for treating IBD. 

25 The invention will be further described by way of the following non-limiting examples and 
figures in which, 

Figure 1 - a - represents mRNA levels of DLG5 and the housekeeping protein b-actin; b - 
activation of the key apoptotic effector caspase-3 and cleavage of its substrate poly(ADP- 
ribose) polymerase-1 (PARP-l).in cells- treated with the specific siRNA directed against 
30 DLG5; c - Cells transfected withDLG5 siRNA showed a48% increase of apoptosis 

(determined by fragmented nuclei stained withDAPI) as compared to cells treated with the 
scrambled control siRNA; data from a representative experiment are shown. 
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Figure 2 - TaqMan analyses of DLG5 expression in DSS colitis model of C57B/6 mice. 
Relative expression is indicated as 2 AACr of DLG5. LI (Large intestine), SI (small intestine). 
Example 1 

Identification of dle5 as a gene associated with IBP. 
5 A genome-wide linkage scan involving 268 families (356 affected sibling pairs) of 

European descent was carried out to identify a susceptibility locus for IBD. Subsequently, a 
hierarchical linkage disequilibrium study was employed to -search for the causal variant(s) 
within a broad 40cM pericentromeric interval on chromosome 10, identified from an initial 
linkage scan. This endeavour was started with a fine mapping experiment involving 523 
10 affected sibling pairs and 16 micro satellite markers at an average distance of 2cM. The fine 
mapping experiment confirmed the initial linkage study, revealing a linkage peak at lOq, 
extending fromD10S201-D10S192, with a maximum MLS score 1.6 at D10S2470. Linkage 
mapping was followed by transmission disequilibrium testing (TdT) using the algorithm 
implemented in the GENEHUNTER software package (Daly MJ et aL, Am J Hum Genet, 
15 Suppl 63: A286, 1998) to further narrow the region containing the susceptibility gene. This 
analysis tests for association of a given marker with the disease phenotype in the presence of 
linkage, and represents the most powerful and robust test for association, while omitting the 
risk of false positives due to stratification bias. The TdT performed on a single trio (one of the 
affected sibpairs and its parents) from each of the families showed a significant single point 
20 association with the disease phenotype at D10S547 (p<0.001) and D10S201 (p<0.01). This 
led to the following strategy: the genomic region underlying the linkage peaks on 10pl4- 
10pl3 (extending from 8Mb-13Mb) and 10q22-10q23 (77Mb-82Mb) were genotyped with 
107 (SNPs) in 200 German IBD families (from the original linkage set) plus an additional 555 
German Trios (368 CD, 187 UC), (a trio being a DNA sample from a IBD affected child plus 
25 DNA samples from its mother and father respectively) and 548 German control individuals 
(non IBD affected individuals). Several SNPs showed significant association with CD and 
IBD, respectively, by TdT in single trios extracted from the families and the trios, further 
confirming the presence of linkage disequilibrium with the susceptibility locus over the 
investigated region. It was therefore decided to perform additional association studies with 
30 SNPs at an average density of 75-120kB to further narrow the region harbouring the 

susceptibility gene. Upon analysis of the high-density SNP panel, the lead region on lOq was 
scanned for nucleotide sequence variants as a highly significant single point association of the 
SNP marker TSC03764S4 located at 78.5Mb to IBD was found with a %2=11.5, p=0.00067. 
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TdT haplotype analyses further strengthened the association lead at TSC0376484 resulting in 
significant 2-marker and 3-marker haplotypes with the neighbouring SNP markers 
TSC0005010 and TSC0000361. A maximum ^2=15.44, p=0.00008 was observed for the 2- 
marker haplotype TSC0376484-TSC0000361 spanning a physical distance of approximately 
5 150kb. The positive SNP was located adjacent to the dlg 5 gene. To verify and further clarify 
the role of dlg5 in IBD, 6 gene-based markers for dlg5 were genotyped. The analysis of these 
markers also involving testing for haplotype blocks, as described by Daly et al. (Nature Genet. 
29:229-232, 2001), clearly showed that the association signal is entirely confined to the dlg5 
gene with a total of 17 markers with a positive association with IBD (Table 5), all of which 

10 are located on a common underlying haplotype. Identical genetic association studies were 
also carried out on an adjacent gene. These proved negative, demonstrating that dlg5 is the 
sole candidate for the susceptibility locus for IBD on 10q22. In depth re-sequencing for all 32 
exons and the exon-intron boundaries was performed in 47 individuals with proven diagnosis 
for IBD. Nucleotide sequencing was performed according to standard protocols, and the 

15 primers used are listed in Table 4. The DNA sequencing and analysis identified 20 novel 
nucleotide sequence variations (in addition to various publicly available SNPs) located in the 
digs gene, 3 of which lead to an amino acid change of the protein, and further a Tbp-deletion 
in the intron flanking.exon 13. The genotype-related risk (GRR) (Risch&Merikangas 
Science, 273(5281): 1516-7, 1996) is estimated to be 1.5 -2.5 based on the TDT results 

20 

Example 2 cloning and seguenge analysis nf 

A genomic DNA cnntip of 41 57 79 bp was const ructed hv a .c^ 1 v of rap 
AL 391421, Al 450306 and AT. 7 31<^ 

Using the database entry for dlg5 cDNA (BMBL accession number AF352034), all 
25 exons but the 5' UTR were mapped to the human genomic sequences covered by AL391421 
or AL4503306. The first 94 bases of AF352034 do not map to this region or any other region 
in the genomic contig around the gene. Since this sequence showed similarities to multiple 
regions within the human genome it was considered as an artifact derived from repetitive 
- - - sequence. A database BLAST search using sequence from exon 2 was used to identify two 
30 pig cDNAs containing novel 5' sequence (EMBL accession numbers BM 484383 and 
BI402246). This sequence was found to match the human genomic contig and was also 
flanked by a conserved splice site on the 3' end and is therefore considered as the true 5' exon 
of the human dlg5. Table 2 shows intron/exon border sequence information of the DLG5 
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gene. SEQ ID NO: 1 shows the sequence of dlg5 including the novel 5' sequence. SEQ ID 
NO: 2 shows the predicted amino acid sequence of dlg5. 

In depth re-sequencing for all 32 exons and the exon-intron boundaries was performed 
in 47 individuals with IBD. Sequences were aligned for SNP detection and in addition to 

5 detection of SNPs present in public databases, the analysis resulted in the identification of 20 
novel SNPs, four of which lead to an amino acid change of the protein, as well as two novel 
genomic short deletions, one a 7bp-deletion in the intron flanking exon 13, potentially 
influencing the splicing of the gene. The genotype-related risk (GBR) is estimated to be 1.5 - 
2.5 based on the TDT results. Table 3 lists the identified SNPs and adjacent sequence as well 

10 as the allelic versions of the SNPs. The Table also includes positions of typed SNPs from 
public databases. Table 4-:shows the primer sequences used to PCR amplify the SNP 
containing regions of dlg5. 



Table 2. 



EXON 


SEQUENCE 


size (bp) 


exon ui 


GAAGGCGCGGgtgag 


(> 166) 


exon 02 


cccagGTTCTACCTA 


. . . AGC AGTGTGGgtgag 


69 


exon 03 


cacagGCACTACCGG 


. . . .TTGACAAGAGgtagt 


163 


exon 04 


gccagGCCCTACCAC 


. . . . ACTTCTACCAgtgag 


144 


exon 05 


ttcagCACACTCCAC 


. . .GCAGCAGCAGgtagg 


184 


exon 06 


tccagGTGTTGAAGC 


. . . CCCTG AGG AGgtagg 


260 


exon 07 


gccagGl l i tiACiGCG 


. . . . GCTGCGGCAGgtagg 


313 


exon 08 


accagATCAAAGACA 


. . . . AC AGCATCCGgtatg 


185 


exon 09 


accagGACACTGTGT 


. . . . AGGAGCTCAAgtagg 


126 


exon 10 


cacagGGAACAGATG 


GAGGGAGACGgtaag 


133 


exon 1 1 


tgcagGAGGATATTG 


. . . .GCCGCTTAAGgtaag 


128 


exon 12 


ctcagGGTCAATGAC 


. . . . GG AC AG AAAGgtagc 


176 


exon 13 


tgcagACAGTGGCAT 


. . . . G ATCGTTGCGgtaag 


104 


exon 14 


tctagATCAATGGCA 


. . . CCTCCTGAAGgtaag 


93 


exon 15 


cctagGTATTCCCTC 


. GTTCCTGGAGgtata 


1020 


exon 16 


tgcagGAACAGAAGT 


. . . .CCGTCTGTGGgtgag 


124 . 


exon 17 


tctagGCACTGTTCC 


..TGCATCCCAGgtatg 


145 
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p-jmn 1 R 

vAUU X O 


c or n o r TCWC^C y A nr^ A f"* 


. . .uUCCCjCCTGGgtaac 


113 


Aynn 1 0 

wAUJI X .7 


nntfiofi' 1 ~ iy"" 1 ' 1 * 1 % r^n A n 


. -TCCGAGAGAGgtaag 


90 


pxnn 90 




. .(jrAAAGGACAGgtgag 


151 


pxnn 91 




. OTTACTGGAGgtgag 


163 


exon 22 


cccagTTCAACGGCA 


CCCGGTCCAG^taao 


1^4 


ex cm 9^ 


tr»r»5i crr^Tf a r"* a r^OTf* 1 


. .OCAGCTCCAGgtcag 


141 


exon 94. 


oncmrrO A TTHPrinn A 


. . .CATCCTGGAGgtgag 


184 


pyoti 9^1 


rrr*r»Qrr r P A r T , f^.(~if~* A (~*C^C^ 

ggcag 1A1 uuLAuLL . . . 


. . .TCTACATCAGgtacc 


149 


pxon-9fi ■ — - 




Trbr(~AAATATGTgtaag— 


-174 


pxrvn 97 


cacago/\ 1 UuALLAA . . 




197 


pxrvn 9R 




. .TGTCCCCTTGgtaag 


144 


exon 29 


tgcagAGGTGATGAA 


. . .CACAGAAAAGgtacc 


199 


exon 30 


cccagAACCGACACT 


. . . AGCACATCAAgtagg 


110 


exon 31 


cacagGGAGCAGAGA 


TACTTCACAGgtagg 


110 


exon 32 


tgcagGGGTCATCCA 


(>1749) 



Table 2 shows exon/intron borders for all exons of the dlg5 gene. The first and last 10 bp of 
each exon (capital letters) together with 5 bp of surrounding introns (lowercase letters) are 
indicated. Also, the total size of each exon is indicated. All sequences can be identified within 
5 human B AC clones with accession number AL39 142 1 and AL450306. 



Table 3 



FEATURE 


COMMENTS 


SEQUENCE 


SEQH) 


tsc0000361, 
allele 1 


SNP, G to A, 
intron 


gctcggtggcagcgaGtgagaggagctcagt 


6 


tsc0000361, 
allele 2 




gctcggtggcagcgaAtgagaggagctcagt 


7 


DLGe2, allele 1 


-SNP.CtoT, 
5' of exon 2 


tggtctcccctctttCcccaggttctaccta 


8 


DLGe2, allele 2 




tggtctcccctctttTcccaggttctaccta 


9 
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allele 1 


SNP, C to T, 
3' of exon 2 


gtgggtgagtaccacCgtctggggaggacac 


10 


allele 2 


gtgggtgagtaccacTgtctggggaggacac 


11 


allele 1 


SNP, A to G, 
within exon 3, 
non-synonymous 


ccctcctcactgaccAgcaagtgaatgagaa 


12 


allele 2 


ccctcctcactgaccGgcaagtgaatgagaa 


13 


uiAjeo, allele 1 


SNP, G to C, 
within exon 3, 
synonymous 


ccgcaagcgcctggcCtttgctacgcatggc 


14 


DLGe3, aUele 2 


ccgcaagcgcctggcGtttgctacgcatggc 


15 


rsiZH-ooyj, allele l 


SNP, CtoT, 
3 of exon 3 


ctgagtgtcccctttCccccacctcatgtcc 


16 


rsl248695, allele 2 


ctgagtgtcccctttTccccacctcatgtcc 


17 


DLGe5A,allele 1 


SNP, A to G, 
within exon 5, 
non-synonymous 


ttcagcacactccacAgccggctcctgagtg 


18 


uiAjeD a, allele 2 


ttcagcacactccacGgccggctcctgagtg 


19 


aUele 1: 


SNP, C to A, 
3' of exon 5 


cccagcccctggagaCtggccatttctccca 


20 


jL/iAje3o, 
allele 2: 


cccagcccctggagaAtggccatttctccca 


21 


rc19AR£Rn "~ 
rbi^fooou, 

allele 1 


SNP, A to C, 
intron 


ggcagccacccacctActcagatccagccta 


22 


rQl94R£Rft 

allele 2 


ggcagccacccacctCctcagatccagccta 


23 


Tit 

allele 1 


SNP, G to A, 
within exon 7, 
synonymous 


gagaaccacgcaggtGaagacagcaaaggag 


24 


DLGe7, 
allele 2 


gagaautacgcaggcAaagacagcaaaggag 


25 


rsl270912, ' 
allele 1 


SNP,TtoC, 


ctcagctgtggtggaTagactggaeagtgcc 


26 
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intron 


ctcagctgtggtggaCagactggacagtgcc 


27 


DT frpIO 
xJ X-/VJTC X u, 

allele 1 


oJNF, G to C, 
within exon 10, 
non-synonymous 


gaagttgtagagttcGagagggagacggtaa 


28 


DLGel 0 
allele 2 


gaagttgtagagttcCagagggagacggtaa 


29 


DT frf»13 A 

aUele 1 


oiNP, C to T, 
within exon 13, 
synonymous 


ggagtgtatgctgcCgctgtgctgcctgga 


30 


x-U-»vjre 1 


ggagtgtatgctgcTgctgtgctgcctgga 


31 


allele 2 








X-U_AJCXDX5, 

aUelel 


deletion 

► GCTGGAG 3' of 
. exon 13 


gcggtaagtctcaagGCTGGAGccagggt 
catctgcc 


32 


dt npi ^ 
allele 2 


gcggtaagtctcaagccagggtcatctgcc 


33 


DT n» 1 A A •» 

xJXAjei^A, i 
allele 1 


SNP, G to A, 
5 f of exon 14 


ggtaggcctgaggccGctctgcctgtggcct 


34 


DT OpIAA 

allele 2 


ggtaggcctgaggccActctgcctgtggcct 


35 


allele 1 Z 


oJNx% U tO Vjr, 

within exon 14, 
synonymous 


tgaatctctgctgcgCagctgccaggactcc 


36 


J.oX<£ < +OU^2J, 

allele 2 


tgaatctctgctgcgGagctgccaggactcc 


37 


allele 1 


oJNF, G to A, 
3' of exon 14 


catgctactccttggGgtcacaggatccttg 


38 


DT frf»1ZL"R 

allele 2 


catgctactccttggAgtcacaggatccttg 


39 


allele 1 


SNP, C to T, 
within exon 15, 
synonymous 


cggggagcccatgcaCgcatcaccccctcgc 


40 


DLG8T5A; ' 
allele 2 






DLGel 5B, 
allele 1 


SNP, G to A, 
within exon 15, 


acgcatcaccccctcGcaaggccagggtccg 


42 
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DLGel5B, 
allele 2 


non-synonymous 


acgcatcaccccctcAcaaggccagggtccg 


43 




DLGelSC, 
allele 1 


SNP, C to T, 
within exon 15, 


actcctcccacctgcCggccaagaaatcctg 


44 




DLGel5C, 
allele 2 


non-synonymous 


actcctcccacctgcTggccaagaaatcctg 


45 




rs2289308, 
allele 1 


SNP, C to T, 
5' of exon 17 


ctgggtcctttggggCgtcttttctcaccaa 


46 




rs2289308, 
allele 2 




ctgggtcctttggggTgtcttttctcaccaa 


47 




rsl248634, 
allele 1 


SNP, C to T, 
within exon 17, 


aagcccatttctaggCactgttccccggagt 


48 


ocsea 


rsl248634, 
allele 2 


synonymous 


aagcccatttctaggTactgttccccggagt 


49 




rsl248635, 
allele 2 


SNP, C to T, 
3' of exon 17 


tacgcttctctgtacCccagctgcccaagcc 


50 




rsl248635, 
allele 4 




tacgcttctctgtacTccagctgcccaagcc 


51 




DLGel8, 
allele 1 


SNP, G to C, 
3' of exon 18 


tcctgggactgagctGatttctctactggga 


52 




DLGel8, 
allele 2 




tcctgggactgagctCatttctctactggga 


53 




DLGel9, 
allele 1 


SNP, C to T, 
within exon 19, 


gagtgtcgtgggctcCgagagaggtaaggac 


54 




DLGel9, 
allele 2 


synonymous 


gagtgtcgtgggctcTgagagaggtaaggac 


55 




rsl248625, 
allele 1 


SNP, G to C, 
3' of exon 21 


tcctagggaacagcaGtgctcccaagtcccc 


56 




rsl248625, 
allele 2 




tcctagggaacagcaCtgctcccaagtcccc 


57 




DLGe23, 
allele 1 


SNP, C to T, 
within exon 23, 


acacctggaccctgcCggtacccactccact 


58 
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DLGe23, 
allele 2 


synonymous 


a^a^ciggaccctgc i ggtacccactccact 


59 


rs2289310, 
allele 1 


SNP, C to A 
within exon 23, 


ggcctagcaccccccuagccaagcagagcag 


~60 


rs2289310, 
allele 2 


non- synonymous 


ggcciagcaccccccAagccaagcagagcag 


61 


rsl261990, 
allele 1 


SNP, T to A, 
5 1 of exon 25 


aggagggatgttgaaTttctgccgtatggtc 


62 


rsl261990, 
" allele 2 




a gg a gggatgttgaaAttctgccgtatggtc 


63 


DLGe25A, 
allele 1 


SNP C to G 
5 1 of exon 25 


gccgtatggtcagcaCtggcccctctcgggt 


64 


DLGe25A, 
allele 2 




gccgtatggtcagcaCrtggcccctctcgggt 


65 


DLGe25B, * 
allele 1 


SNP G to A 
5 ! of exon 25 


gcactggcccctctcGggtgcccaagctgcc 


66 


DLGe25B, 
allele 2 




gcactggcccctctcAggtgcccaagctgcc 


67 


rsl058198, 

allele 1 i 


SNP, C to T 
: within exon 26, 


gagcataagaaggaugacatcctctacgtg 


68 


rsl058198, 
allele 2 


synonymous 


gagctnaagaagga l gacatcctctacgtg 


69 


DLGe26, 
allele 1 


deletion of A, 


ggggtggggtggggcAggggtcgccgagggc 


70 


DLGe26, 
allele 2 


3' of exon 26 


ggggiggggtggggcggggtcgccgagggc 


71 


rs2289311, ■ 
allele 1 


SNP, T to C, 


agggcagcagggtctTgatggccctgcccag 


72 


ts2289311y 
allele 2 


5 ! of exon 27* 


agggcagcagggtctCgatggccctgcccag 


73 


DLGe27, " 
allele 1 


SNP, C toT, 
within exon 27, 


agatgacaatagcgcCacaaagacgctgtca 


74 
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DLGe27, 
allele 2 


synonymous 


agatgacaatagcgcTacaaagacgctgtca 


75 


rs2241831, 
allele 1 


SNP, C to A, 
5' of exon 29 


ggttactgacagctgCtgagcagtgttcttc 


76 


rs2241831, 
allele 2 


ggttactgacagctgAtgagcagtgttcttc 


77 


rs2241833, 
allele 1 


SNP, C to T, 
intron 


cctggatgcctgggaCgacagacatgacaga 


78 


rs2241833, 
allele 2 


cctggatgcctgggaTgacagacatgacaga 


79 


fs2579150, 
allele 1 


S5NP, C toT, 
&3!,of.exon31 


ggctgttttcttagcCgtggagaagcccgcg 


80 


rs2579150, 
allele 2 


ggctgttttcttagcTgtggagaagcccgcg 


81 


rsl058202, 

allele 1 s 


'SNP, GtoA, 
,3'.UTR 


gccgcctgaggggacGccagactcagctctt 


82 


rsl058202, 
allele 2 


gccgcctgaggggacAccagactcagctctt 


83 


rsl058203, 
allele 1 


-SNP, C to T, 
3'UTR 


aagtagaagtctgtcCgtctatgaacatgcg 


84 


rsl058203, 
allele 2 


aagtagaagtctgtcTgtctatgaacatgcg 


85 


rs2165046, 
allele 1 


SNP, A to G, 
3'UTR 


tgtctatgaacatgcAggggaaggatccgga 


86 


rs2 165046, 
allele 2 


tgtctatgaacatgcGggggaaggatccgga 


87 


rs2165047, 
allele 1 


SNP, G to A 
3'UTR 


ctctcctggaaggacGtcacaactccaggtg 


88 


rs2165047, 
allele 2 


ctctcctggaaggacAtcacaactccaggtg 


89 


rs2579.151, 
allele 1 


SNP.CtoG, 


tctccagaagcttcaCtcacactccactggt 


90 
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rs257915L 
allele 2 


3' of gene 


tctccagaagcttcaGtcacactccactggt 


91 


tsc0376484, 
allele 1 


SNP.Gto A, 
3' of gene 


agaettaeacttcttGaacaacctntflnoiT 


oo 
yZ 


tsc0376484, 
allele 2 


agagttagacttcttGaacaaccttttaagg 


93 




Table 3 identifies the SNPs and adjacent sequence as well as the particular allelic version. The 
t able includes novel SNPs identified through mutation detection as well as SNPs from pub lic 

databases. All the public domain SNPs used have an rs- or tsc-number which will identify 

5 them uniquely in the genome. 



Table 4 



Name 


Sequence 


oEQ ID NO: 


Dlg5exlF 


5-CCATGACGGAGGTGGAAGC 


oo i 
yi 


Dlg5exlR 


5-AGAGGAGCGAGTCCACCGA 


98 


Dlg5ex2F 


S'-GACTGATGATCAGCTGGCTTG 


99 


Dlg5ex2R 


5-CGGAAGGATGATCCTGTGAG 


96 


Dlg5ex3F 


5'-CCAGGGGAGGATGCAA 


97 


Dlg5ex3R 


5-ACAAGCACACCACTATCAGGG 


98 


Dlg5ex4F 


5'-CCTAATCCAGGACCTGGTTC 


99 


Dlg5ex4R 


5'-CTTGCACAGGGACAGGACTAG 


100 


Dlg5ex5F 


5'-GCTGTATCTACGGGAAGTGTTG 


101 


Dlg5ex5R 


5-GATCACAGATGTGAGCCAACG 


102 


Dlg5ex6F 


S'-CCTTGTCATCAGTCTCACCCTC 


103 


Dlg5ex6R 


5'-GAGCCACGATTCCCAAGACA 


104 


Dlg5ex7F 


5'-ACATCTCGCCACCTCTCTTG 


105 


Dlg5ex7R 


5'-TGCTGTAGGAGAGGCTGAAA 


106 


Dlg5ex8F 


S'-TCAGCAACCTCTCCCTCTTC 


107 


Dlg5ex8R 


5-ACGCCAGCTTGAGGTCAC 


108 


Dlg5ex9F 


5'-TGACCTTGTCCTCTGCTCCT 


109 


Dlg5ex9R 


5'-CATTGCCTTGCCCAGAAG 


110 
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Dlg5exlOF 
Dlg5exlOR 



Dlg5exllF 
Dlg5exllR 
Dlg5exl2F 
Dlg5exl2R 



Dlg5exl3F 
Dlg5exl3R 



Dlg5exl4F 
Dlg5exl4R 



Dlg5exl5aF 



Dlg5exl5bF 
Dlg5exl5cF 
Dlg5exl5aR 



Dlg5exl5bR 
Dlg5exl5cR 



51 - 



S'-CCGTGGcrCTCTCTGTTCAC 

s'-tcaccgtctcctccttcatc" 



5'-TTTGTGAAATGGTTGCTGT 
5'-GTGCTACCTGGCTCTCTTCG 
5'-CCCCTGAGTGAGAGTTGTGG 
5'-CCACTGAGGTTGATGTGCAG 



5'-C ACCAGG GTAGATGTAACTG AG 
5'-GATlClTATTTCCCTCCCAGAC 
5'-CTCCTGACTTTGGCACCrTG" 



S'-TTCAGACCAGCGTCCAGTCA 



S'-TGCTTTACCTCTGGGGATGG" 



5-CTTCCGCTCAGATGCCTCTG 



s'-cagaaggagcgactccattaag" 

5--GCAAAGGCACCAGGCTAAAC 



5-GGTAGTAGCTGGAAGCAATGC 



111 
7l2~ 
ITJ 
TIT 
Tl5" 
Tl6" 



117 
TIF 



119 
120 



121 
12T 



123 
124 



125 



Dlg5exl6F 



Dlg5exl6R 



Dlg5exl7F 



Dlg5exl7R 



Dlg5exl8F 



Dlg5exl8R 
Dlg5exl9F 



5'-CAGAGAGCTTCTCAGGCACTG 
S'-TGGCCACACTCCACTCnTC 



5-CTCAGGGCTGAAAACACATG 



5-GAGCCACAGCCACATTGTGA 
5'-GGAAGClTCTCCACCAATGA 



S'-GAACCCTTGCCTGGTCTGTG 



S'-GAAAGCAATGGCTCTGACAG 



126 
127 



128 



129 



130 



131 



132 



S'-GACTGGTAGCCTGGTGGAGA 



133 



Dlg5exl9R 
Dlg5ex20F 
Dlg5ex20R 



5'-GAAGTTCTCAGCTAAGCCCAG 



5-GCAATGCAGAGCCTAGCATC 



134 



135 



Dlg5ex21F 



Dlg5ex21R 



5'-TGCTGGGACACTCAAGCTAC 



5'-CTGCACrGTCAGATCATATGC 



5-ACACCAGGATGGGCTCAGTG 



136 



137 
"138" 



Dlg5ex22F 



Dlg5ex22R 



Dlg5ex23F 
Dlg5ex23R 



5-GAACAGCAGTGCTCCCAAGT 
5'-CCAGAACTTACGGCTGGCAC" 



S'-UCTCAGATCTAGTTGCCACAGG" 



139 



140 



141 



5'-CCAcrrGGAGAATGTGCTCAG~ 



142 
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lj ig j ex^Hr' 


D -GGAAGAAGCAGGCAGAAAGC 


143 




j - 1 Li 1 AG 1 CCTCCGTCJL 1 1 GGTG | 


144 


x-j ig j CAZ.J r 


D -uAt 1 GAG I CCTTCCTGCAGAG 


145 


TVIa^PY9</R 

LJlgO CA^J XV 


C' A C^C* A O O A A A A A /^T/Trp/^O A 

D -GAGGAGGAAAAGAGTCTCCAG 


146 


x^igjeA^ur 1 


-> 1 CGUCG aitcctgatc aag 


147 


Dlg5ex26R 


5'-GAAGCAGAATCCCTCCTCCAG 


148 


l/lgJCAZ / IT 


D -Vj 1 AbLL 1 1 GAGACCTGCCA AG 


149 




ci TP TP P rvrrvTO a a r~* a rrnnn a /-^ 

j-lul GGCTCTGAAGATGGC AG 


150 


LJ IgD cxzor 


Ci PTPPTP A t*/^ orpo /~i a rvp/in A /""l 

D -G 1 GG 1 GATGCTGGACTCCAG 


151 


LJ IgD CaZ01\ 


:> -LAUbL 1 1 G 1 GGAAC ACTGTG 


152 




D - <jr 1 GAGA 1 1 G A 1 GC ATGGC AG 


153 


TYI rr C 0 vO Q "D 


5 -C AGGC ACAGGTGAACTC AGAC 


154 




3 -LlUlblbbLl I rACIGCCTTG 


155 


J-/IgD cXj UK. 


:> -GGA1 AGGCCCATCTCTCATTC 


156 


Dle5ex31F 


vjv-* x vj a a vJ X Vj X vJv/ XXX £\ L VJ X vJT 


1 


Dlg5ex31R 


5'-AGAATCCTGACGTTGGCCAG 


158 


Dlg5ex32F 


5-CTGGTGAAGGAGAGTCAGGTG 


159 


Dlg5ex32R 


5'-GTGCTTCTGGGTCCTGGTTC 


160 



Table 4. Primers used for mutation detection of the dlg5 gene. Each primer is named after 
exon number and either F for forward or R for reverse primer. 



5 Tables 



SNP 


IBD families and trios 






IBD 

trios 








T:U 


T/U 


chi2 


p- value 


T:U 


T/U 


chi 2 


p- value 


TSC0376484 


272:198 


1.37 


11.65 


p=0.0006 


206:150 


1.37 


8.81 


p=0.0029 


rs2579151 


275:225 


1.20 


5.00 


p=0.02 


212:170 


1.24 


4.62 


p=0.03 


rs2165047 


283:232 


1.21 


5.05 


p=0.02 


214:168 


1.27 


5.54 


p=0.018 


rs2165046 


369:296 


1.25 


8.01 


p=0.004 


277:212 


1.30 


8.64 


p=0.003 


rsl058203 


286:244 


1.17 


as. 


n.s. 


221:181 


1.22 


3.98 


p=0.04 


rs2579150 










220:178 


1.23 J 


4.43 


p=0.03 
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rs2241833 


287:237 


1.21 


4.77 


p=0.02 


219:172 


1.27 


5.65 


p=0.017 


rs2289311 


360:287 


1.25 


9.22 


p=0.002 


267:204 


1.30 


8.43 


p=0.003 


rsl058198 


348:270 


1.28 


9.84 


p=0.0017 


262:193 


1.35 


10.46 


p=0.001 


rsl261990 


284:236 


1.20 


4.43 


p=0.03 


221:174 


1.28 


5.59 


p=0.01 


DLGel8 


359:280 


1.28 


9.77 


p=0.002 


272:206 


1.32 


9.11 


p=0.003 


rs2289308 


349:276 


1.26 


8.53 


p=0.003 


265:205 


1.29 


7.66 


p=0.005 


rsl248634 


287:235 


1.22 


5.18 


p=0,02 


220:175 


1.25 


5.13 


p=0.02 




Zo4:z44 


1.16 


n.s. 


n.s. 


217:177^ 


1.22 


4.06 


p=0.04 


rsl248696 


134:105 


1.27. 


3.52 


p=0.06 


107:84 


1.27 


2.9 


p=0.09 


DLGe2 


128:44 


2<90 


41.02 


p=0.000000 


102:32 


3.0 


34.00 


p=0.000000 


TSC0068513 


327:281 


Jfcl6 


n.s. 


n.s. 


257:203 


1.26 


6.34 


p=0.011 



Table 5. Data demonstrating the association between IBD and dlg5. 



Example 3 

5 Expression analysis Of DLG5 o n a panel of normal tissues and colon biopsies from patients 
with IBD using real timfe PCR 

Real-time PCRiexperiments where performed on Applied Biosystems 7900 HT in 384 
format with Syber Green chemistry (double-stranded DNA binding dye, minor groove 
binding) and fluorescent probes. In order to be able to detect any unspecific amplification and 

10 melting, curve analysis was performed after each completed PCR. 

Each sample was run in duplicate with 4ng of template in each reaction (lOjutl). The 
PCR reactions were run at 50 cycles for both sample and reverse transcriptase negative 
controls. Non-template controls where also studied to confirm that the signals were not due to 
the primers themselves. The primers where designed fromcDNA sequences in the program 

15 Primer Express, manufactured by Applied Biosystems. The primers were all complementary 
to the intron exon junction. As internal standard h36b4 (acidic ribosomal phosphoprotein P0) 
and |3-actin were used to normalize for differences in RNA input and cDNA synthesis. 
The relative expression in the different samples was calculated by the parameter Gr <threshold 
cycle). C T is defined as the fractional cycle number at which the fluorescence passes a 

20 threshold above baseline. 

The relative expression of the dlg5 gene is calculated with the formula: 2' AC T where AC T 
is defined as: (C T DLG5 -C T 36B4) alternatively (C T DLG5-C T p-actin) 
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C T DLG5 = the threshold cycle for the dlg5 gene in the tissue of interest. 
C T 36B4 = the threshold cycle for 36b4 in the same tissue. 
C T P-actin= the threshold cycle for P-actin in the same tissue 

The primers used for dlg5 (exon27-exon28) oriented in 5' to 3' direction were forward- 
5 CCAGTGACTCCATTCCACTCTTT (SEQ ID NO: 161) and reverse: 

CGGTGCAGTCCACCTTCTGCSEQ ID NO: 162). Also primers were used for exon 29-30 
forward: GGAGAAGCGGCCATTTCG (SEQ ID NO: 163) and reverse- 
CTCAATAGCGTGCGGAGCAACSEQ ID NO: 164), exon 32 forward- 
CAGTGTGGGTGTCTTCGTTTGG (SEQ ID NO: 165) and reverse- 

Al'l AACAGGACGCATAGCTTAAGGA ( SEQ ID NO: 166) on a subset of the material For 
pnmers against exon 29-30 and exon 32, fluorescent probes were used for detection using 
sequences: exon 29-30: AAAGGAGATCACAGAAAAGAACCGACACTGC (SEQID NO- 

167) and exon 32: TGAGGCTAGATATGTCTGGCTGAAGATTTGATGTG (SEQ ID NO- 

168) . Detection of amplified fragments for exon 27-28 was done using Syber green 
15 Chemistry. 

For endogenous control genes such as acidic ribosomal phosphoprotein P0(h36b4) and 
P-actin, primer sequences were, forward: CCATTCTATCATCAACGGGTACAA (SEQ ID 
NO: 169) and reverse: AGCAAGTGGGAAGGTGTAATCC (SEQ ID NO: 170) for h36b4 
and forward: AGCCTCGCCTTTGCCGA (SEQ ID NO: 171) and reverse- 
20 CTGGTGCCTGGGGCG (SEQ ID NO: 172) for P-actin For p-actin, detection with a 

fluorescent probe was used with the sequence CCGCCGCCCGTCCACACCCGCC (SEQ ID 
NO: 173), while Syber Green Chemistry was used for detection of h36b4. 
Tissue panel 

Complementary DNA (cDNA) was purchased from Clontech Laboratories Inc. Twelve 
25 dnTerent tissues was used: heart, placenta, liver, skeletal muscle, kidney, pancreas as part of 
human MTCTTM I #K1420-1. Testis, prostate, small intestine as part of human MTCTTM H 
K#1421-l. Ileum cDNA is a part of human foetal MTCTTM fl K#1425-l. Descending 
Colon is part of human Immune MTCTTM H K#1425-l. Brain mRNA (Human brain whole 
#6516-1) from whichcDNA.was prepared_using Superscript™. First-Strand Synthesis System - 
30 for RT-PCR (Gibco BRL), was used for the cDNA synthesis. Five hundred nanogram of total 
RNA was used for each reaction using the oligo dT primer provided in the kit for the reverse 
transcriptase (RT) reaction and RT negative controls. 
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Colon Biopsies 

Colon biopsies were taken from patients using endoscope and a grasp biopsy tool 
Biopsies were taken from the mucosa at different parts of the colon and terminal ileum. 
Results 

Tissue distribution of DLG confirmed high expression in placenta and testis and also 
high expression in brain, prostate, descending colon and ileum as shown in Table 6. 
Analysis of colon biopsies showed a markedly decreased expression in Crohn's (CD) patients 
compared to non-IBD disease controls. Almost 6-fold difference was seen in one subset of the 
samples consisting of 6 non-IBD controls and 29 CD cases shown in Table 7. 
10 A more modest decrease was seen in an another subset consisting of 16 hospitalised 

nonnals(HN)(approx.l.5^fold) and 39 non-IBD controls (DC)(approx. 2-fold) compared to 23 
healthy controls as showniin Table 8. Whereas a consistent difference was seen in a third and 
a fourth subset of 40 ulcerative colitis (UC) and 49 CD sarrples compared to 25 healthy 
normals. The results iarecshown in Table 9 and 10, suggesting a 1.5- and 2.3-fold decrease 
15 respectively. \ 
In summary: • . 

We have foundJDLG5 to be expressed in most tissues examined: placenta, heart, 
prostate, skeletal muscle,-liver, pancreas, kidney, brain, colon, testis, ileum and small 
intestine. The highest amount of expression was detected in placenta and ileum We have also 
20 identified a significant difference of expression between colon biopsies from IBD patients 
relative to non-IBD controls and healthy controls. 



Tissue 


2 -UUt +JQQQ 


Prostate 


7.21 


Placenta 


31.1 


Heart 


1.7 


Skeletal muscle 


1.71 


Liver 


0.42 


Pancreas 


1.44 | 


Kidney 


2.39 


Brain 


19.86 


• Descending Colon 


5.28 


Testis 


6.91 


Ileum 


7.9 


Small intestine 


2.26 
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Table 6. Relative expression of DLG5 in selected human tissues. 



Exon 27-28 


DC(n=6) 


CD(n=29) 


Mean 


23.88 


4.08 


Median 


24.22 


4.043 


Pvalue 


2.82406E-06 





Table 7. Relative expression analysis using real time PCR according to formula 2 ACt *1000 
using h36b4 as endogenous control. Colon biopsies were taken fromnon IBD disease controls 
5 (n-6) and Crohn's disease patients (n=29). P-values are calculated using Student's T-test 
assuming non-equal variance. 



Exon 29-30 


Normal (n=23) 


HN (n=16} 


DC (n=39) 


Median 
P value 


2.778 
2.800 


1.939 
1.980 
4.364E-04 


1.476 
1.300 
5.303E-O9 








1 Exon 32 


Normal (n=251 


HN (n=16) 


DC (n=39) 




Mean 
Median 
Pvalue 


0.954 
0.866 


0.630 
0.485 
3.24E-02 


0.420 
0.251 
1 .63E-05 





Table 8. Relative expression analysis using real time PCR according to formula 2"* a *1000 
10 for colon biopsies taken from healthy normals, HN = hospitalised normals (n=16), DC = Non- 
IBD disease controls (n=40). P-values are calculated using Student's T-test assuming non- 
equal variance. Primers against both exon 29-30 and exon 32 were used showing reproducible 
results. 



Exon 29-30 


Normal (n=25) 


UC (n=40) 


Mean 
Median 
P value 


2.382 
2.408 
4.31 E-07 


1.444 
1.355 






Exon 32 


Normal (n=23) 


UC (n=39) 


Mean 
Median 
-Pvalue 


1.00 
0.88 
-- 1.-10E-06 


0.45 
0.43 



15 Table 9. Relative expression analysis using real time PCR according to formula 2"^ *1{)00 
for colon biopsies taken fromhealthy normals and Ulcerative colitis (UC) patients. P-values 
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are calculated using Student's T-test assuming non-equal variance. Primers against both exon 
29-30 and exon 32 were used showing reproducible results. 



Exon 32 


Normal (n=25) 


CD (n=49) 


Mean 
Median 
P value 


0.927 
0.881 
1.003E-04 


0.570 
0.517 




Exon 29-30 


Normal (n=24) 


CD<n=49) 


Mean 
Median 
P value 


2.508 
2.170 
2.07E-04 


1.568 
1.480 



5 Table 10. Relative expression analysis using real time PCR according to formula 2 *1000 
for colon biopsies taken from health normals and Crohn's disease (CD) patients. P-vahies are 
calculated using Student's T-test assuming non-equal variance. Primers against both exon 29- 
.-jw 30 and exon 32 were used showing reproducible results. 

: 10 Example 4 

siRNA inhi bition of dlg5 expression induces apoptosis in HeLa cells 

In order to simulate reduced levels or loss of function of dlg5 in a standardized in vitro 
model of epithelial cells, dlg5 mRNA was knocked down in Hela cells by small interfering 
RNAs (siRNAs). 

15 For transfection experiments, Hela cells (ATCC) (<passage 20) were plated in 6- well 

plates at 2-4 X 10 s cells/well, and 24hrs later transfected with custom-made dlg5 siRNAs or 
scrambled control siRNA using the TransMessenger transfection kit (all from Qiagen, Hilden, 
Germany). Cells transfected with the siRNA directed against dlg5 and/or a vector encoding 
enhanced green fluorescent protein (pEGFP) were analysed 48 hours after transfections. In all 

20 experiments, a scrambled, nonspecific siRNA (control siRNA) as well as the transfection 
reagent TransMessenger (TransM.) alone were used as internal controls. Optimal knockdown 
of dlg5 transcripts was achieved using 2 X 10 5 cells/well and 4 /xg/well of the siRNA 
S'-GAAGGATGACGTGGACATGCT-S' (positioned at base 389 of the coding sequence of 
DLG5 - SEQ ID NO: 174), 8 /xl/well of Enhancer R and 40 /xl/well of TransMessenger 

25 reagent. For visualization of transfection efficiency, cells were co-transfected with 2 fig 
pEGFP-Cl (BD Clontech, Palo Alto, CA), seeded on coverslips, fixed for DAPI staining and 
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detection of fluorescence using an Axiophot microscope (Zeiss, Germany), and pictures were 
captured by a digital camera system (Axiocam^eiss). 

Expression analyses For estimation of level of knockdown of the dlgS transcript, mRNA 
levels of digs and fi-actin were analysed by RT-PCR. The following primer pairs were used- 
5 (3-actin: 5 '-GATGGTGGGC ATGGGTC AG-3 ' (SEQ ID NO: 175) and 
5'-CTTAATGTCACGCACGATTTCC-3' (SEQ ID NO: 176), 
dlg5: 5'-AAACTGTATGACACGGCCATGG-3' (SEQ ID NO: 177) and 
5'-CTCCTCCCTGTATTTCTCCGACTC-3* (SEQ ID NO: 178). 

— - additi ° D> eXpreSSi ° n ° f ^ ^ible gene, OAS 1 was mea sured in order 

10 * exclud * falling artefacts which might be induced by siRNA. Primers used for detection 
ofOASl were5'-ACCATGCCATTGACATCATCTG-3'(SEQIDNO: 179) and 
5'-AAGACAACCAGGTGAGCGTCAG-3' (SEQ ID NO: 180). 
Western blot analyses 

In otder to measure apoptosis in DLG5 depleted cells, western blotting analysis of 

15 caspase-3 and PARP-1 cleavage was performed. Cell extracts (standardized to 10 ng of total 
protem/lane) were separated by 12 or 15% denaturing SDS-PAGE and transferred to a 
polyvinylidene difluoride membrane (Hybond-P; 0.8 mA/cm 2 for 60 min; Amersham 
Pharmacia Biotech) by. semidry blotting using an electrobtotter (Bio-Rad). Antibodies were 
dduted in blocking buffer. Membranes wet* subsequently washe4, incubated with ECL-Hus 

20 Detection Reagent, .and exposed to Hyperfilm BCL (both from Amersham Pharmacia 
Btotech). Primary antibodies directed against caspase-3 and PARP-1 were obtained from Cell 
Stgnahng Technology (Beverly, MA). Protein contents we re normalized by subsequent 
hybndization with an antibody against fi-actin (Sigma-Aldrich, St. Louis, MO). Between me 
stamings with specific Abs, blots were shipped in 2% SDS, 62.5 mM Tris, and !00 mM 2-ME 

25 for 30 min at 50°C, washed, and blocked again. The bands were quantified using «he 
densttometry program SigmaGel (Jandel Scientific, Sau Rafael, CA). AH Western blots were 
exposed to film for varying lengths of time, and only films generating suhsaturating levels of 
mtensity were selected for dcnsitometrical and statistical evaluation Linearity was assured in 
mdependent experiments by using different amounte of materia, and-multiple film-exposures 

30 (date not shown). Bach Western blotting experiment was conducted with two separate 
membranes in parallel to detect potential stripping artifacts. 
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Results 

siRNA inhibition of dlg5 expression strongly induced apoptosis of Hela cells without 
any further pro-apoptotic stimuli. Apoptosis was determined by immunoblots detecting 
activation of the key apoptotic effector caspase-3 and cleavage of its substrate poly(ADP- 

5 ribose) polymerase- 1 (PARP-1). Moreover, co-transfection of a vector encoding enhanced 
green fluorescent protein (pEGFP) revealed that after 48 hours, up to 50% of the siRNA- 
transfected Hela cells showed fragmented nuclei after DAPI st aining , which is a hallmark of 
apoptosis (Fig.l). The effect was not due to an interferon response (Bridge, A. J. et ah Nat 
Genet 34,263-264 (2003), since transcript levels of 2'5'-oligoadenylate synthetase (OAS1) 

10 were not influenced by dlg5 siRNA (data not shown). Initial time kinetics revealed that DLG5 
mRNA levels were significantly reduced 24 hours after siRNA transfection, but even stronger 
after 48 hours. The extent of apoptosis (determined by cleavage of caspase-3 and PARP-1) 
closely matched dlg5 mRNA levels. Furthermore, staining of mucosal sections from 
homozygous CD patients.carrying the disease associated variant of rs2289310, 

15 ggcctagcaccccccAagccaagcagagcag (SEQ ID NO: 181) (n=3) for cleaved caspase-3 as a 
marker for apoptosis showed increased apoptotic staining compared to that for patients 
homozygous for ih&mldtype allele, ggcctagcaccccccCagccaagcagagcag (SEQ ID NO: 182) 
(n=3). 

In summary 

20 These novel findings suggest that functional DLG5 is an essential factor for epithelial 

cell survival 

Example 5 

Expression analyses of DLG5 on colon samples from mice treated withDextran Sulphate 
25 Sodium (DSS\ 

The DSS mouse colitis model was used to investigate the expression pattern of DLG5 
during acute colitis as well as during the recovery phase of DSS induced colitis. 
Mice 

7-12 weeks old C57BL/6 mice were exposed to DSS (Dextran sulphate sodium; TDB 
30 Consultancy AB, Sweden) added to the drinking water at a concentration of 3% for 6 days. 
Animals in the recovery phase received DSS for 5 days followed by a water period of 2 weeks 
j(d5+14). C57BL/6 mice not exposed to DSS were used as control animals. For each group 
five individual animals were analysed. After 7 days (acute phase) or after d5+ 14 (recovery 
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phase) animals were sacrificed and tissue samples from spleen, large intestine and small 
mtestine were collected. Tissues were flushed in NaCl and then quick-frozen in liquid 
nitrogen before RNA preparations. 
RNA preparations 

RNA from approximately 50 mg of tissue from spleen, large intestine and small 
intestine was prepared using the TRIZOL method (Invitrogen) according to the 
manufacturer's instructions. After DNase treatment, cDNA synthesis was performed using 
Superscript First Strand Synthesis Kit for RT-PCR (Invitrogen Life Technologies). 
Real time PCR 

Keai-tnne FCR-experiments where performed on Applied Biosystems 7900 HT in 384 
format with Syber Green chemistry (double-stranded DNA binding dye, minor groove 
binding) and fluorescent probes. In order to detect any unspecific amplification and melting, 
curve analysis was performed after each completed PCR. 

Each sample was run in triplicate using 3ng of template for each reaction (lOjxl). The 
15 PCR reactions were run at 40 cycles for both sample and reverse transcriptase negative 

controls. Non-template controls where also included to confirm that the signals were not due 
•to the primers themselves. The primers, where designed from cDNA sequences in the program 
Primer Express, manufactured by Applied Biosystems. The primers were all complementary 
to the intron-exon junction. Acidic ribosomal phosphoprotein P0 (m36b4) was used to 
20 normalize for differences in RNA input and cDNA synthesis. 

The relative expression in each tissue was calculated using the formula ACt=C t DLG5 
-Ct 36B4 where C T is defined as the fractional cycle number at which the fluorescence passes 
a threshold above baseline. Relative values for dlg5 in distinct tissues was calculated using 
the comparative (AACT) method, where the ACT value from spleen was set as reference (User 
25 Bulletin#2,ABI PRISM 7700 Sequence Detection System). 

C T dlg5 = the threshold cycle for the dlg5 gene in the tissue of interest. 
C T 36B4 = the threshold cycle for 36b4 in the same tissue. 

- -ACt=C t DLG5-C T -36B4 - . 

30 AACT=ACT(tissue of interest)-ACT(spleen) 

The primers used for the analyses of murine dlg5 (exon 29-30) were forward: 
CAGAAAAGAACCGGCACTGTCT (SEQ ID NO: 183), reverse: 
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TGTGGTGCAGCCTCTCGAT (SEQ ID NO: 184) and probe sequence: 

CTGGACATCGCCCCGCATGC (SEQ ID NO: 185) 

As endogenous control, the mouse gene for acidic ribosomal phosphoprotein 
P0(m36b4) was analysed using following primer sequences, forward: GAG GAA TCA GAT 
5 GAG GAT ATG GGA (SEQ ID NO: 186) and reverse: AAG CAG GCT GAC TTG GTT GC 
(SEQ ED NO: 187). For detection of m36b4 a fluorescent probe with the sequence TCG GTC 
TCT TCG ACT AAT CCC GCC AA (SEQ ID NO: 188) was used. 
Results 

The expression levels of dlg5 was analysed in samples from large (colon) and small 
10 intestine (terminal ileum) from C57BL/6 mice in acute and recovery phase of DSS induced 
colitis. Interestingly, only samples from large intestine showed significant variations between 
different phases of disease progression (Figure 2). During the acute phase of colitis, the 
expression level of dlg5 in large intestine was found to be significantly lower than that found 
in corresponding samples from control animals or animals in recovery phase. Also, in large 
15 intestine an increased expression level during the recovery phase was detected. No significant 
differences in dlg5 expression levels could be detected when samples from small intestine 
during different phases of disease progression were analysed. As the intestinal inflammation 
is limited to the large intestine in this model, the expression pattern suggests a 1ir»V to the 
colon and/or inflammation. 
20 In summary 

We have found variations in the expression of dlg5 in large intestine during distinct 
phases of colitis in mice. These results, from an animal model, strengthen the role for DLG5 
during colitis disease progression. 
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Claims; 

1. A method for identifying a compound capable of modulating the action of the DLG5 
protein which method comprises subjecting one or more test compounds to a screen 

5 comprising a polypeptide containing the amino acid sequence shown in SEQ ID NO: 2, or a 
homologue thereof or a fragment of either. 

2. A method for identifying potential anti-IBD therapeutic compounds comprising 
contacting an assay system capable of detecting the effect of a test compound against 

-10-e^sion-levet^L-eSTwiuT^ t u )l npouud and assessing the change in expression level 
of DLG5. ...^ 

3. A method of screening for a compound potentially useful for treatment of IBD, which 
comprises assaying the compound for its ability to modulate the activity or amount of DLG5. 




15 



4. The method according to claim 3, which involves a method selected from: 

(i) measurement of DLG5 activity using a cell line which expresses the DLG5 polypeptide or 
using purified DLG5 polypeptide; and 

(ii) measurement of dlg5 transcription or translation in a cell line expressing the DLG5 
20 polypeptide. 

* 

5. A cell comprising a reporter gene under the control of the dlg5 promoter. 

6. A method of testing potential therapeutic agents for the ability to suppress IBD 

25 phenotype comprising contacting a test compound with a cell engineered to express the DLG5 
polypeptide; and determining whether said test compound modulates expression of the DLG5 
polypeptide. 

7 ; _ _AjaethodJb^ 
30 contacting a potential therapeutic agent with a cell or cell line as described above and 

determining inhibition of dlg5 transcription by the potential therapeutic agent by reference to 
a lack of or reduction in expression of the reporter gene. 
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8. A method of preparing a pharmaceutical composition comprising: 

i) identifying a compound as useful for treatment of IBD according to a 
screening method as described herein; and 

ii) mixing the compound or a pharmaceutical^ acceptable salt thereof with a 
5 pharmaceutical^ acceptable excipient or diluent. 

9. Use of a compound able to modulate the activity or amount of DLG5 in the 
preparation of a medicament for the treatment of IBD. 

10 10. A method for the diagnosis of IBD or determining susceptibility to develop IBD, 
which method comprises: 

(i) obtaining a protein or nucleic acid -containing sample from an individual; 

(ii) detecting the presence or absence of a variant DLG5 on the basis of the 
presence of a polymorphic amino acid within the DLG5 protein, or a 

Z 15 polymorphic base within the dlg5 gene sequence; and, 

(iii) determining the status of the human by reference to the presence or absence of 
a polymorphism in DLG5. 
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Sequence lisi-.^g, 

TaS^ZV p S ^° WS " 7430 PairS nucle °tide sequence of an alternative 

dig cDNA. Predicted start and stop codons are underlined. The first 328 
nucleotides are derived from the novel 5' exon 

SEQ m NO :2 3 S T S ^ DLG5 P6Ptide SeqUSnCe ° f 1919 — • -ids 

SEQ^ID NO: 3 shows the sequence of the novel 5- sequence identified for 

SEQ ID 5 T Pr ° POSed 5 ' SeqUenCe fr ° m AF3 "034. 

ID N ° : 5 shows the Putative promoter sequence for dlgS 
SEQ ID NO: 189 shows the 7269 base pairs nucleotide sequence of N- 
termxnally truncated dlgS cDNA. Predicted start and stop codons are 
under llned . The firsfc nucleQtides m ^ ^ 



SEQ ID NO: 190 shows, the predicted N-terminally truncated DLG 5 peptide 
sequence of 1809 amino acids derived from SEQ ID NO: 189. 



SEQ ID NO : 1 



VGTGT 



\CAAGTTGGAGGTG 
rCGCCATCCACAAT 
rGCTGAAGCAGACA 
VGTGCGCCCTCTCC 

:gcagaacaaggac 

3AACCACGCAGGTG 




^^^^ 



\GACAAG 
3CGGGAC 
3CAGAAG 



CGGAACTTGATACAGCAC^TAACTCCAOGCAGACAGACATCTT^ 
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GAAGACAGGAAGGAGCGAGGCCCCCG&GGAG 
CCTGGGGGACCCTTGCAOGTCTGCCCCC&GGCCTO^ 
AGCTCCTTCCGCTCAGATGCCTCTGGGGACCGTGGCTTTGC^ 
CGGCXSGCCACTGCTGCCCTTTGAGACCGAGGTGGGCCCCTGTGGGGTTGGG^ 
5 CTGGACAAGGCAGACTCTGAAGGCTCCAACAGCGGCGGGACCTGG 

AGCTCCACGGCAGTGCCTGAGAAGCTCTCTGTTTATAAAAAGCCAAAGCAAAGAA^ 
ATCTTTGACCCTAACACTTTCAAACGCCCCCAGACACCCCCCAAAATAGACTACCTC 
CCAGGTCCTGGGCCTGCTCACTCTCCCCAGCCCTCCAAGAGGGCGGGGCCTCTGACACCC 
CCAAAACCTCCCAGAAGGAGCGACTCCATTAA<5TTCCAGCACAGGCTGGAGACTAGCTCC 
1 0 <3AGTCAGAAGCCACTCTGGTGGGCAGCTCCCCATCCACTAGTCCCCCGAGCGCCCTGCCC 
CCTGACGTGGACCCCGGGGAGCCCATGCACGCATCACCCCCTCGCAAGGCCAGGGTCCGC 
ATTGCTTCCAGCTACTACCCTGAAGGAGATGGGGACTCCTCCCACCTGCCGGCCAA<3A^ 
TCCTGTGATGAGGACCTCACCTCCCAGAAGGTGGATGAGCTGGGGCAGAAGCGT 
CCAAAATCTGCTCCCAGTTTTCGGCGGAA<3CTTGCTCCAGTAGTGATTCCTC 
1 5 CTGGAGGAACAGAAGTGTGTCCCGGCCAGTGGAGAACTCTCCCCGGAGCTCCAGGAGTGG 
GCACCTTACTCGCCTGGGCATTCCAGCCGGCACAGCAACCCCCCGCTATACCCTAGCAGG 
CCGTCTGTGGGCACTGTTCCCCGGAGTTTGACCCCCAGCACCACTCTGAGCTCCATCCTC 
CGGAACCCCATCTACACTGTGCGCAGTCACAGGGTCGGCCCCTGCAGCTCTCCACCTGCG 
GCCCGAGATGCTGGCCCCCAGGGTTTGCATCCCAGTGTCCAGCACCAGGGACGCCTGAGC 
20 CTGGACCTGAGCCACAGGACCTGCAGCGACTACTCCGAGATGAGAGCCACCCATGGGTCC 
AACTCACTGCCCTCCAGCGCCCGCCTGGGTTCTTCGAGTAACTTGCAGTTCAAGGCGGAA 
CGCATTAAAATCCCATCAACACCAAGATATCCGGGGAGTGTCGTGGGCTCCGAGAGAGGT 
TCAGTGTCACATTCTGAATGCAGCACTCCTCCACAGTCACCCCTGAACATCGACA 
TCCTCTTGTAGCCAGTCCCAGACCTCAGCCTCCACATTGCCCAGAATCGCTGTCAACCCC 
25 GCGTCCCTCGGGGAGCGGAGAAAGGACAGGCCTTATGTGGAGGAGCCACGCCACGTGAAG 
GTGCAGAAGGGCTCAGAGCCGCTGGGCATCTCCATCGl'GAGTGGAGAGAAGGGCGGCATC 
TACGTCTCCAAGGTGACCGTGGGGAGCATCGCTCACCAGGCTGGCCTCGAGTATGGGGAT 
CAGTTACTGGAGTTCAACGGCATAAACCTGCGGAGGGCCACGGAGCAGCAGGCGCGGCTC 
ATCATCGGGCAGCAGTGTGATACCATCACCATCCTGGCCCAGTACAACCCCCACGTGCAC 
30 CAGCTCAGCAGCCACTCCCGGTCCAGCTCACACCTGGACCCTGCCGGTACCCACTCCACT 
CTCCAGGGCAGTGGCACCACCACCCCGGAGCATCCATCTGTCATCGACCCACTGATGGAG 
CAGGACGAGGGGCCTAGCACCCCCCCAGCCAAGCAGAGCAGCTCCAGGATTGCGGGAGAT 
GCCAACAAGAAGACCCTGGAGCCACGCGTTGTCTTCATCTUUU^GTCCCAGCTGGAGCOT 
GGGGTGCACTTGTGTGGTGGGAACCTGCATGGGGTGTTTGTGGCCGAGGTGGAGGATGAC 
35 AGTCCTGCCAAGGGTCCTGACGGCCTCGTGCCAGGGGACCTCATCCTGGAGTATGGCAGC 
CTGGACGTGCGGAACAAGACAGTGGAGGAAGTCTATGTGGAGATGCTGAAGCCCAGGGAT 
GGCGTCCGCCTGAAGGTGCAGTACCGCCCTGAGGAGTTCACGAAGGCCAAGGGCCTGCCT 
GGTGACAGCTTCTACATCAGGGCCCTGTAGGACCGGCTGGCAGATGTGGAGCAAGAGTTG 
AGCTTTAAGAAGGACGACATCCTCTACGTGGATGACACCTTACCCCAGGGCACGTTCGGG 
40 TCCTGGATGGCTTGGCAGCTGGACGAGAATGCCCAGAAGATCCAGCGCGGGCAGATTCCC 
AGCAAATATGTGATGGACCAAGAATTCTCCAGGAGGCTCAGCATGTC 
GACAATAGCGCCACAAAGACGCTGTCAGCGGCTGGACGCCGGTCCTTTTTTCG 
CACAAGCACAAACGGAGCGGGTCCAAGGACGGGAAAGACCTGCTCGCCTTGGATGCCTTT 
TCCAGTGACTCCATTCCACTCTTTCAAGATTCGGTGAGCCTGGCCTATCAGCGGGTCCAG 
45 AAGGTGGACTGCACCGCTCTGAGGCCTGTCCTGATTCTGGGGCCTTTC 

AAGGAGATGCTGGTGAATGAGGCTCCTGGCAAGTTCTGCAGATGTCCCCTTC 
AAGGCCTCCCAGCAGGCCATTGAGCGGGGTGTCAAAGATTGCCTGTTTGTCGACTATAAG 
CGGAGAAGCGGCCATTTCGATGTGACC^CTGTGGCGTC^TAAAGGAGATCACAGAAAAG 
AACCGAGACTGCCTCCTGGACATTGCTCCGCACGCTATTGAGCGGCTCCACCACATGCAC 
50 ATCTACCCCATTGTCATCTTCATCCACTACAAGAGCGCCAAGCACATCAAGGAGCAGAGA 
GACCCCATCTACCTGA<3GGACAAGGTGACTCAGAGGGATTCCAAAGAGCAGTTTGAGGCG 
GCGCAGAAGCTTGAGCAGGAGTACAGCAGGTACTTCACAGGGGTCATCCAGGGAGGAGCC 
CTGTCAAGCATTTGCACTCAGATCTTGGGAATGGTCAAT^ 

TGGATTCCAGCCTGCCCGCTCT^GAGAATGCTGTGCTGTGGATGACTGCAGCTGGCCGC 
55 CTGAGGGGACACCAGACTCAGCTCTTTTCTAGCGACTGAAAGTAGAAGTCTGTCCGTCTA 

TGAACATGCGGGGGAAGGATCCGGAACCAGGACCCAGAAGCACCTCCTTTGTAGACAGAG 

GGCCACGGCTGCGTGCGATCCAGGCCCAGGCCCACACACTCTGCCCGTGTCACACGTGTG 

CTTTAACACAAAACAGATAACACTAAAGACGGGTTCAGCACCCACCTTTCTTTAGCCAGC 

TGATCAGAGATGCTGCAAAGAGAACCTTTCGGATCACTCGTTT^ 
60 ATTTGGTGGTTTATGTTTACTTGAACGGCTCCATGTTG 

TCTGTCAACCCCCTGTCGCTTTGGTGTTCGTTTC 

• GGAACCTCAATGGGGGCTGCTTTGCTTTGGGAGGTTCTTGTTG 

ACAAACCTCCTGCTCCTTGGTGGCACCTCTCCTGGAAGGACGTCACAACTCCAGGTGCTC 
AGACTGCCTGTGGCAGCAGAACCAGTCCCTTTGGCATTTTC 
65 GACTTTGGCATTCTTACAAACTCX5TCTC 

CGTTTCTCCTCCATCTGTGCTTATTACTTGAGGACTGTGTCTGCTCCGTC 
GGGCAGGGCTGCAGTGGGGTCCAGGTGGTGTTCAGCTGTGCTGATGCCTGCCATTC 
CTCCTTAGGCTCTGTAAGTCGTGACAGCCTTCATCAGTGGAATGTTTC 
TAAACTTTTTAGA<^GTGGCAGGTACATCAGTTCTTTTTGATATC 
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50 SEP Tn Mn 7 t 
55 SEP ID INTO- 21 

SEP Tn mo. 

gtcttgctctggc^ggcgcgccgcccggctttc^ggcgagt™ 

BEST AVAILABLE CPPY 
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CTGAGGGGGAGGGGGAGCATGCCCGGCTGGAGCCCGGCCTCGGGGCGCGCCCCX^ 

TCCTCCTTCTCCTCCTCCGTCTCCTCCTCCTCCTCTTCCAGTC 

CX3GAGCCCX3GGAGGC{X3CCGCTGCTCAGCGCCGCCC^CGCC^ 

GGCTCCCGCCGCCTGACGGCTCCCGGCGCCCTAGGGCTGCTGCCGAGGTGCTCGCCTG 
5 GGGGGCCCGGGAGGCGTCC 

SEQ JD No? 3,89 

GAGGCGK3GAGGCGCCAAGGCX3GAGCTGCTGCTCAAGCTGCTCTTGGCCAAGGAG 
CGGGCGGCGCTGGAGAAGACGCAGCCTCACCTGCTGCCCATTCTCTACCTC 
10 CCCGCCGAAGGCGCGGGTTCTACX:TACAGCK3TCCTOTCC^CC ATGC CCTCAGACTC 
AGCAGTGTGGGGACTACCGGGAAGGCGCCGTCCCCACC^ 

GAGAACCTCTCCATTCAGCTGCGGCTGATGACCCGGGAGAGAAACGAGCTCC<3CAAGGGCCTGGCCTT^ 
CATGGCACGGCCTTTGACAAGAGGCCCTACCACAGGCTGAATC^ 

CGAGCCATGTCGGACCTGCAGAGCCTGCAGAAOCAGCACACCAACGCCTTGAAGAGGTG 
1 5 ^AGACTGACTTCTACCAC^CACTCCACAGCCGGCTCCTGAGTGACCAGACTCX3GCTO 
CTGAGGCGGGAGAATGGGCAGCTGCTGCGGGAGCGAAACCTGCTGCAGGA^^ 

CACGAGGAGGACCAGAAGGAGATCGGTGACCTCCGTGCCCAGCAGCAGCAGGTGTTGAAGCACAA^ 

GAGATTCTCAACAAACTGTATGACACGGCCATGGACAAGTTGGAG 

AAGAGGTACAGTGAGAAAGTCGCCATCCACAATGCAGACCTGA 
20 CGGTTGCTGAAGCAGACAGAGATGCTGACCCAGCAGAGGGACACGGCCATCCAGCTGCAGCACCAGTGCG 

TCCCTGAGGAGGTTTGAGGCGATCCACCATGAGCTGAACAAGGCCACGGCGCAGAAC 

ATGGAGCTGCTGCAGTCAGAGCTGACCGAGCTGAGAACCACGC^GGTGAAGACAGCyU^GGAGTC 

AGGGAGGAGGGGGACGCTGTGTACAGCGAGTAGAAGCTCATCATGAGTGAGCGTC 

GACAAGCTGCAGACCGAAGTGGAGCTGGCCGAGTCCAAGCTCAAGAGGAGCACATCTGAGAAGJ^ 
25 GAGGAGATGGAGGCGCTGCGGC^GATCAAAGA<^CGGTGAC^TGGATGCTC^ 

ATCCTTCGAAAGCAGTGCAAGGCTCTGTGCCAGGAGCTGAAGGAAGCCCTCCAGGA 

CGGCGGGACTGGGCCTTCCAGGAGCGAGACAAGATTGTAGCAGAGCGTGAC 

CTGAGGCGGGAGGGGGACCGTGCGGTGAGCGAGCTGGCTGAGGCCCTGCGCAGCCTGGATGACACCCGCAAGCAG 
AAGAATGATGTGAGCCGCGAGCTGAAGGAGCTCAAGGAACAGATGGAATCCCAGTTGGAAAA 
30 CGACAGCTGATGGCCCACAGCTCCCACGACTCGGCCATTGACACX3GATTCCATGGAGTCG 
GAGTTCGAGAGGGAGACGGAGGATATTGACTTGAAGGCACTGGGGTTTGATATGGCAGAA 
TGTTTCCCGGGGGACTGTGGCATATTTGTCACTAAAGTGGACAAAGGAAGCAT 

AATGACTGGCTGCTGAGAATCAACGATGTGGACCTCATCAACAAGGACAAGAAGCAGGCCATCAAGG 

AATGGGGAGGGGGCCATCAACATGGTCGTGCGGCGGAGGAAGTCCCTGGGTGGGAAGGTGGTC 
35 ATCAACCTCAGTCGACAGAAAGACAGTGGCATCAGTCTGGAGAATGGAGT 

AOCCCTGCCGCTAAAGAAGGGTCCCTTGCTGTGGGAGACAGGATC^ 

AAGTCTCTGAATGAATGTGAATCTCTGCTGCGGAGCTGCCAGGACTCCCTG 

CCTCAGAGCTCCTC<3TGGAGTGGCCAGAACATTTTTGAAAATATCAAAGA^ 

GCCCATGGCCCGGAGGTCCAGGCTCATAACAAACGGAACTTGATACAGCACAATAACTCCAC^ 
40 TTCTACACGGACAGGCTGGAAGACAGGAAGGAGCCAGGCCCCCCAGGAGGCAGCAGCTCCTTTCTGCA^ 

TTCCCTGGGGGACCCTTGCAGGTCTGCCCCCAGGCCTGTCCCAGTGCCTCTGAGCX3TAGCCTO 

TCAGATGCCTCTGGGGACCGTGGCTTTGGGCTGGTCGACGTGCGTGGCCGGCGG 

GAGGTGGGCCX^CTGTGGGGTTGGGGAGGCCTCCCTGGACAAGGCAGACTCTC 

TGGCCCAAGGCCATGCTCAGCTCCAGGGCAGTGCCTGAGAAGCTCTCTGTTTATAAAAAGC 

45 tccatctttgaccctaacactttcaaacgcccccagacaccccccaaaatagactacc 
cctgctcactctccccagccctcx:aagagggcggggcctctgac^^ 
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ATTAAGTTCCAGCA^^ 

GACCTCACCTCC(^GAAGGTGGATGAGCTGGGGCAGAAGCGTCGCCGGCCAAAATCTGCTCC 
AAGCT^crcCAGTAG*^ 

AGGCCGTCTGTGGGCACTGTTCCCCGGAGTTTGACCCCCAGCAC<^CTGTGAGCT 
CATCCCAGTGTCCAGCACCAGGGACGCCTGAGCCTGGACC^rcAGCCACAGGACCTC 

AGAG T CCCA ^ 

TCTGAATGC^GCACTCCTCCACAGTCACCCCTGAACATCGACACCCTGTCCTCTTGTAGCGAGTCCCAG 




GAGCCACGCCACGTGAAGGTCCAGAAGGGCTCAGAGCCGCTCGGCATCTCCATCGTGAG^^^ 
ATCTA^TCTCC^GG^^ 

TTCAACGGCATAAACCTGCGGAGCGCGACGGAGCAGCAGGCGCGGCTC^ 

ACCATCCTGGCCCAGTAGAACCCCCACGTGCACCAGCTCAGCAGCCACTCCCGGTCCAGCTCACACCT 
GCCGGTACCCACTCCACTCTCCAGGGCAGTGGCACCACCACCCCGGAGCATCC^TCTGTCATCGAC 

««««^^ 

ACCCTGGAGCCACXJCGTTGTCTTCATCAAAAAGTC 

A !^r AGTATCGCAGcc ^ AcGTCcGGAA ^ 

TACA^GGGCCCTGTAC^CCGGCTGGCAGATOTGGAGCAAGAGT^AGC^ 

G ?^ A r AcA ^^ 

^ G ^^ 

GATTCGGTGAGCCTGGCCTATCAGCGGGTCCAGAAGGTGGACTGCACCGCTCTGAGGCCTGTCCTGATTCT^ 
CCTTTGCTGGACGTGGTGAAGGAGATGCTGGTGAATGAGGCTCCTGGCAAGTTCTGCA 

^^ccagcaggcca^agcg 

CATTTCGATGTGACCACTGTGGCGTGAATAAAGGAGATC^CAGAAAAGAACCGAGA^ 

GCGGCGCAGAAGCTTGAGCAGGAGTACAGCAGGTACTTCACAGGGGTCATCCAGGGAGGAGCCCTGTCAAGCATT 

iZ;r GATC ™ G ^^ 

TAGAAG^TCCG^TATOAACA^CGGGGGAAGGA^ 

GAGGGCGACXSGCTGCGTGCGATCCAGGCCCAGGCCCACACACTCTGCCCGTGTCACACGTGTGCTT^^ 
TTC^ATCACTCGTTTACAAGCCTTTTCTAAGTATTTGGTGQTTTATGTTTA 
CTTGGAACCTCAATGGGGGCTGCTTTGCTTTCGGAGGTTCTTGTTGGTGGGA 
CCTTTGGCATTTTCCTCCCACAATGGGGAAGGTGACTTT^ 

BEST AVAILABLE COPY 
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CGTGGGCAGGGCTQCA<3TGGGGTCCAGGTGGTGTTCAGCTC 
TGTAAGTCX3TGACAGCCTTCATCAGTCCAATC 
TCAGTTCTTTTTGATATGAAAACATTCATC 
TTGTCACTATCAACAGTCTAAAAAGAAAAACTGAGG^ 
5 TACTCAGTGTGGGTGTCTTCGTTTGGGGGCTTTT^ 
ATTTGATGTGGTTCCTCCTTAAGCTATGC^ 
TGGGGTAGGACCTATATTTTAATACTGTTCCTAACATTTCATT 
TCTTTGTAATTCTAGACACTAGATTGTAGTTTAGCCATAACTGATC 
CAGTTGTTCAAAAAAGAGACAAGTTTCAGTC^ 

1 0 GACAAACTATGAAAAACTGTAGACTATTCTCAAGGTATTAACTCGCAGACCCTCTC 
AAGTTACAGGCAGAGTGGGACTGAGATGGTACAGTGTGCACAGACAGGTACTO 
GTACTAAAATGTTACTTTGTATAAAAGTT^ 

SEP ID NO ; 190 
1 5 MPSDSESSSSLSSVGTTGKAPSPPPLLTDQQWnEa^ 

PDYERLKIQCVRAMSDLQSLQNQHTNALKRCEEVAKETDFYHTL 

LQQSV^MKRLHEEDQKEIGDLRAQQQQVL^^ 

SRLEQLGEENQRLLKQTEMLTQQRDTAIQLQHQ^^ 

QVKTAKE S EKYREERDAVYS EYKL IMSERDQVT s ELDKLQTEVELAE SKLKS S TSEKKAANEEMEALRQ IKDTVT 
20 ^AGRAlsTKEVEILRKQCKALCQELK^ 

ALRSLDDTRKQKNDVSRELKELKEQMES QLEKEARFRQLMAHS SHDSAIDTDSMEWETEVVEFERETEDIDLKAL 
GFDMAEGVNEPCFPGDCGIFVTK^ 

SLGGKWTPLHINLSGQKDSGISLENGWAAAVLPGSPAAK^^ 
DSLTLSLLKVFPQSSSWSGQNIFENIKDSD^ 

25 PGGSSSFLHKPFPGGPL'QVCPQACPSASERSLSSFRSDASGDRGFGLVDWGRRPLLPFETEVGPCGVGEASLDK 
ADSEGSNSGGTWPKAMIiSSTAVPEKLS VYKKPKQRKS IFDPNTFKRPQTPPKIDYLLPGPGPAHS PQPSKRAGPL 
TPPKPPRRSDSIKFQHRLETSSESEATLVGSSPSTSPPSALPPDVDPGEPMHASPPRKARVRIASSYYPEGDGDS 
SHLPAKKSCDEDLTSQKVDELGQKRRRPKSAPSFRPKLAPVVIPAQFLEEQKCVPASGEL 
SRHSNPPLYPSRPSVGTVPRSLTPSTWSSILRNPIYTVRSHRVGPCSSPPAARDAGPQGLHP^^^ 

30 SHRTCSDYSEMRATHGSNSLPSSARLGSSS^QFKAERIKIPSTPRYPRSWGSERGSVSHSECSTPP 
TLSSCSQSQTSASTLPRIAVNPASLGERRKDRPYVEEPRHV^ 

AGLEYGDQLLEFNG INLRSATEQQARL I IGQQCDTITILAQYNPHVHQLS S HSRS S SHLDPAGTHSTLQGSGTTT 
PEHPSVIDPLMEQDEGPSTPPAKQSSSRIAGDANKKTL^ 
KGPIXSLVPGDLILEYGSLDVRNKTVEEVYVEMLKPI^ 
35 £LS FKKDDILYVDDTLPQGTFG SWMAWQLDENAQKIQRGQI PSK YVMDQEF SRRLSMSEVKDDNS ATKTLSAAAR 
RSFFRRKHKHKRSGSKDGKDLI^tfiD^ 

pgkfcrcplevmkasqqaiergvkix:lfvdykrrsghfdvt^ 

ivifihyksakhikeqrdpiylrdkvtqrhskeqfeaaqkleqeysryftc 
vlwipacpl 
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